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NOVEL TYROSINE KINASE 

5 

BACKGROUND 

The ability of cells to respond to environmental cues is in large part due to the 
interaction of cell-surface receptors with external stimuli. A number of factors that 
interact with receptors have been identified, among which are receptor- binding proteins 

10 that are soluble, membrane-bound, or exist in both forms. 

One class of receptors, the receptor tyrosine kinases (RTKs), has been 
intensively studied and shown to be crucial to the growth and differentiation of a variety 
of cell types (Yarden and Ullrich, Ann. Rev. Biochem. 57:433-478, 1988). Tyrosine 
kinases are enzymes that catalyze the phosphorylation of tyrosine residues. Tyrosine 

15 phosphorylation is associated with signal-transduction across the cellular plasma 
membrane. The protein-tyrosine kinase family can be grouped into two very broad 
families: the above-mentioned RTKs, which are, or are intimately associated with, 
membrane-spanning growth factor receptors; and those that are associated with the 
membrane but lack a transmembrane sequence (Yarden and Ullrich, 1988, supra.). 

20 The RTKs can be further divided into five subgroups on the basis of structural 

similarities in their extracellular domains and the organization of the tyrosine kinase 
catalytic region in their cytoplasmic domains. Subgroups I (epidermal growth factor 
(EGF) receptor-like ), II (insulin receptor-like) and the ephleck family contain cysteine- 
rich sequences (Hirai et aL, Science 238:1717-1720, 1987; Yanden and Ullrich, Ann. 

25 Rev. Biochem, 57,443-478, 1988; Lindberg and Hunter, Mol Cell Biol 10:6316- 
6324, 1990). The functional domains of the kinase region of these three classes of 
RTKs are encoded as a contiguous sequence (Hanks et aL, Science 241, 42-52, 
1988). Subgroups III (platelet- derived growth factor (PDGF) receptor- like) and IV 
(the fibroblast growth factor (FGF) receptors) are characterized as having 

30 immunoglobulin (Ig)-like folds in their extracellular domains, as well as having their 
kinase domains divided in two parts by a variable stretch of unrelated amino acids 
(Yanden and Ullrich, 1988; Hanks et aL, 1988). 

While all members of the RTK family share a related cytoplasmic catalytic 
domain, the extracellular, ligand-binding domains of these receptors have adapted 

35 patchwork structure utilizing several structural motifs. The variablility in the structure 
of the ligand-binding domains of the RTKs almost certainly reflects the diversity of the 
ligands for these receptors (Ullrich and Schlessinger, Cell 61, 243-254, 1990). These 
lijgands range from relatively small, soluble peptides to cell surface proteins that 
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themselves resemble receptors. Examples of ligands bound by certain members of the 
RTK family are polypeptide growth factors and hormones. 

To deal with this diversity of ligands the RTKs have evolved extracellular 
domains that are a composite of several structural motifs. For example, the 
5 extracellular domain of the axllark gene contains both Ig-domains and fibronectin type 
III (FNIII) repeats (O'bryan et aL, MoL Cell Biol 11:5016-5031, 1991; Rescigno et 
al., Oncogene 6:1909-1913, 1991), while members of the eph family have those two 
motifs separated by a (non-EGF-like) cysteine-rich domain (Hirai et al., Science 
238:1717-1720, 1987; Lindberg et al., MoL Cell Biol 10:6316-6324, 1990; Lhotak et 

10 aL, MoL Cell Biol 1 1:2496-2502, 1991; Chan and Watt, Oncogene 6:1057- 

1061,1991). This diversity strongly suggests that this family of receptors evolved by 
accumulating the structural motifs needed for ligand binding and combining these 
motifs with a conserved catalytic domain. 

In view of the role tyrosine kinases play in cell growth and differentiation, as 

15 well as signal transduction, isolation of novel tyrosine kinases enables one to study 
such biological processes. Identifying tyrosine kinases and their ligands also permits 
exploration of methods for inhibiting or enhancing the interaction thereof, depending on 
the desired biological effect. 

SUMMARY OF INVENTION 
20 The present invention provides a novel receptor (a protein tyrosine kinase), 

isolated DNA encoding the tyrosine kinase, recombinant expression vectors containing 
the isolated DNA, and host cells transformed with the recombinant vector. Also 
provided is a method for producing the novel protein by cultivating the transformed 
host cells under conditions that promote expression of the tyrosine kinase, and 
25 recovering the expressed tyrosine kinase. 

BRIEF DESCRIPTION OF FIGURES 
Figure 1 presents a nucleotide sequence and deduced amino acid sequence of 
human ork cDN A. The nucleotide sequence is derived from two overlapping cDNA 
clones isolated from placental cDNA libraries. The initiating methionine codon and the 

30 transmembrane region are each underlined. The two cysteines of the immunoglobulin 
domain are boxed, as are the three EGF-like repeats. Brackets enclose the region 
amplified by PCR using primers based on conserved kinase domain sequences, as 
described in example 1. Numbering of nucleotides is in the left margin, while 
numbering of amino acids is on the right. The signal peptide comprises amino acids 1- 

35 18, with the threonine residue at position 19 being the first amino acid of the mature 
protein. The extracellular domain comprises amino acids 19-745, and the cytoplasmic 
domain comprises amino acids 773-1 124. 
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Figure 2 presents an analysis of the EGF repeats in the ork sequence. The 
three EGF- repeats in the ork sequence were aligned with those in the tan-l and laminin 
Bl genes. The conserved cysteine residues are boxed and other conserved residues 
(conserved in two ork repeats and at least one other repeat) are marked with an asterisk. 
5 Figure 3 presents the results of studies described in example 2 below. The 

human ork cDNA encodes a 140 Kd phosphoprotein with in vitro tyrosine kinase 
activity. A. COS cells were transfected with the ork cDNA {ork), or a control plasmid 
without cDNA insert (mock), metabolically labelled with 35s-met/cys, and 
immunoprecipitated with the PI rabbit antiserum, raised against the carboxy-terminal 

10 21 amino acids of the predicted ork amino acid sequence (PI), or with pre-immune 

serum (C). The position of P140 or ^ is indicated, as are the positions of size standards. 
B. COS cells were transfected and immunoprecipitated as described above. The 
immunoprecipitates were incubated with 32p_ATP and separated by SDS-PAGE. The 
positions of PMO 0 ^ and size standards are indicated. C. The gel from Panel B was 

15 transferred to a membrane, exposed to X-ray film and the band corresponding to 

P\40° r k was excised. The filter slice was boiled in 6N HC1 and eluted material was 
separated by two-dimensional electrophoresis. The positions of standards for 
phosphoserine, phosphothreonine and phosphotyrosine, as well as the origin, are 
indicated. D. COS cells were transfected and immunoprecipitated as described above. 

20 The immunoprecipitates were separated by SDS-PAGE, transferred to a membrane and 
incubated with an anti-phosphotyrosine antibody. The positions of size standards are 
indicated. 

Figure 4 shows the results of a study of expression of ork mRNA in human 
tissues, as described in example 3 below. Two micrograms of poly(A)-containing 
25 RNA from the indicated tissues was hybridized with an 32p_uTP-labelled RNA probe 
from the ork cDNA. The positions of size standards are marked. 

Fi gures 5a and 5b depict a comparison of the amino acid sequences of ork 
(top lines) and tie (bottom lines), the latter being a tyrosine kinase of the same subgroup 
as ork. The sequence comparison was generated by the GAP computer program 
30 described. Identical amino acids are indicated with a line; conservative or similar 

amino acid changes are indicated with two dots or one dot, respectively. The percent 
similarity and percent identity for the ork and tie sequences were found to be 64.5% 
and 47.5%, respectively. 

Fi gure 6 presents a comparison of the amino acid sequence encoded by a 
35 murine ork clone with the corresponding portion of a human ork protein. 
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DETAILED DESCRIPTION OF INVENTION 
The present invention provides a novel receptor protein tyrosine kinase, isolated 
DNA encoding the tyrosine kinase, recombinant expression vectors containing the 
5 isolated DNA, and host cells transformed with the recombinant vector. A method for 
producing the novel protein involves cultivating the transformed host cells under 
conditions that promote expression of the tyrosine kinase, and recovering the expressed 
tyrosine kinase from the cell culture. 

The novel protein is designated herein as ork (orphan receptor tyrosine kinase). 

10 An ork cDNA isolated from human placenta and transfected into COS-7 cells encodes a 
140Kd protein with the following combination of structural motifs in its extracellular 
domain: an immunoglobulin (Ig)-like domain followed by three epidermal growth 
factor (EGF)-like cysteine-rich repeats, which in turn are followed by three fibronectin 
type III (FNIII) repeats. Of the human tissues tested (example 3 below), the ork gene 

15 is expressed predominately in placenta and lung, with lower levels in umbilical vein 
endothelial cells, brain and kidney. 

Human ork is within the scope of the present invention, as are ork proteins 
derived from other mammalian species. As used herein, the term 19 ork" includes 
membrane-bound proteins (comprising a cytoplasmic domain, a transmembrane region, 

20 and an extracellular domain) as well as truncated proteins that retain the desired 
biological properties. Such truncated proteins include, for example, soluble ork 
comprising only the extracellular (ligand binding) domain. 

Using a polymerase chain reaction-based approach we have isolated ork cDNA 
and characterized this novel receptor protein tyrosine kinase. As described in example 

25 1 below, degenerate oligonucleotide probes based on certain sequences that are 

conserved within the kinase domain of RTKs were used as primers in a PCR reaction. 
Single-stranded cDNA derived from human placental poly(A) + RNA was employed as 
the template. The PCR reaction products were inserted into a cloning vector and the 
nucleotide sequence of the cDNA inserts in a number of the resulting recombinant 

30 vectors was determined. A clone containing a cDNA insert of about 200 basepairs with 
a novel DNA sequence was identified. 

The cDNA insert was excised, radiolabeled, and used to probe an oligo-dT- 
primed human placental cDNA library. A hybridizing clone comprising a 2.3 Kb insert 
was identified, sequenced, and found to be a partial clone encoding a C-terminal ork 

35 fragment. The cDNA insert was excised, radiolabeled, and used to probe a random- 
primed human placental cDNA library. A clone comprising a cDNA insert about 4.0 
Kb in length that contained an entire coding region for the novel protein, as well as 5* 
and 3' untranslated sequences, was isolated. The 4.0 kb ork cDNA in plasmid 
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pBLUESCRIPT®SK in E. coli strain DH5a was deposited with the American Type 
Culture collection on May 28, 1992 and was given accession number ATCC 69003. 
The strain deposit was made under the terms of the Budapest Treaty. 

A DNA sequence for a human ork cDNA is presented in Figure 1, along with 
5 the amino acid sequence encoded thereby. Human ork DNA and amino acid sequences 
identical to those of Figure 1 are presented in SEQ ID NO:l and SEQ ID NO:2. 
Several features of the ork amino acid sequence are noteworthy. The extracellular 
domain is a patchwork of three structural motifs. Between amino acid residues 21 1 and 
340 are three copies of an EGF-like cysteine repeat (Davis, New Biologist 2:410-419, 

10 1991). The EGF-like repeats in the ork gene (boxed in Figure 1) differ from the 

consensus motif in that they have 8 cysteines each instead of 6. In this regard they are 
closely related to those in the tie gene (Partanen et al., Mol Cell Biol, 12:1698-1707, 
1992). The proteins with the next most closely related cysteine repeats are laminin Bl, 
laminin B2 (Sasaki et al., J. Biol Chem 263:16536-16544, 1988) and TAN, the 

15 human homolog of the Drosophila notch gene (Ellison et al, Cell 66:649-661). The 
three EGF-repeats in the ork sequence were aligned with those in the tan-\ and laminin 
Bl genes in Figure 2. In each case the cysteines in the repeat were aligned. 

Amino terminal to the EGF repeats are paired cysteines (boxed in Figure 1) 
indicative of Ig-like domains. This Ig domain is most similar to the C2 type as 

20 described by Williams and Barclay, Ann. Rev. Immunol 6:381-405, 1988. A second 
pair of cysteine residues follow the EGF repeats; however the additional amino acid 
residues required to generate a proper Ig-fold are missing. At the membrane proximal 
region of the extracellular domain, between amino acids 440 and 733, are three repeats 
of the fibronectin type III motif. Thus, the extracellular domain of ork is a structural 

25 mosaic consisting of repeats of at least three different functional motifs. 

Amino acid residues 746-772 are hydrophobic in nature and likely serve as a 
transmembrane domain. The cytoplasmic domain contains all the amino acid sequence 
hallmarks of a tyrosine kinase (Hanks et al., Science 241:42-52, 1988). A comparison 
of the ork amino acid sequence with known protein tyrosine kinases showed ork to be a 

30 member of the RTK family. 

Within the receptor tyrosine kinase family, the ork and above-mentioned tie 
genes form their own subgroup. The tie gene (described by Partanen et al., supra) has 
an extracellular domain that, like ork, comprises (from N- to C-terminus) an Ig-like 
domain followed by three EGF-like domains, an incomplete Ig-like domain, and three 

35 fibronectin type III repeats next to the transmembrane region. Additionally, a cDNA 
form lacking the first of the three epidermal growth factor homology domains was 
isolated by Partanen et al., supra, suggesting that alternative splicing creates different 
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r/e-type receptors. COS-7 cells transfected with a tie cDNA expression vector produced 
a glycosylated protein of 1 1 7 kDa. 

A comparison of the amino acid sequences of ork and tie is presented in Figures 
5a and 5b, which shows them to be distinct proteins. The comparison presented in 
5 Figures 5a and 5b was generated using the GAP computer program, which is described 
in detail below. The percent similarity and percent identity of the ork and tie amino acid 
sequences was found to be 64.5% and 47.5%, respectively. The RTK subgroup 
containing ork and tie is characterized by a unique arrangement of structural motifs in 
the proteins' extracellular domain, as described above. There are also several features 

10 in the cytoplasmic kinase domain worth noting. There is a short amino acid insert 
between the two pans of the kinase domains (Fig. 1). This insert is similar in size to 
that seen in the ret gene (Takahashi and Cooper, MoL Cell. Biol. 7:1378-1385, 1987). 
However, unlike the insert sequences in other RTK subgroups, the insert in the ork and 
tie genes contains no tyrosine residues. Several tyrosine residues are found in the 

15 cytoplasmic tail. As phosphorylation of tyrosine in either the kinase insert or the 

cytoplasmic tail is required for association of the RTK molecule with putative signal 
transduction molecules, this feature of the ork and tie genes mostly closely resembles 
the EGFR subfamily where phosphorylated tyrosine residues in the cytoplasmic tail 
have been shown to associate with other proteins such as phospholipase C-y (Margolis 

20 etal., Cell 57:1 101-1 107, 1989). 

One embodiment of the present invention provides an ork protein comprising an 
extracellular domain that comprises (from N- to C-terminus) an immunoglobin-like 
domain, three EGF-like cysteine-rich repeats, and three FNIII repeats, wherein the 
amino acid sequence of said extracellular domain is at least 80% identical to the 

25 sequence shown as amino acids 1 9-745 of SEQ ID NO: 1 . The extracellular domain 
amino acid sequence preferably is at least 90% identical to the extracellular domain 
sequence of SEQ ID NO: 1 . The percent identity between two amino acid sequences 
may be determined by using the GAP computer program available from the University 
of Wisconsin and described in detail below. Such ork proteins include those having a 

30 transmembrane region and a cytoplasmic domain (or a portion thereof) in addition to the 
extracellular domain. Such proteins also include those in which a sequence containing 
a pair of cysteines is positioned between the last EGF-like repeat and the first FN III 
repeat. Such a sequence (a stretch of amino acids resembling an incomplete Ig-like 
domain) is found in the native human ork protein described above. Since this sequence 

35 is not believed to function as an Ig-like domain, it is possible that ork proteins lacking 
some or all of this partial Ig-like domain (amino acids 341-439) will retain the ability to 
bind a ligand. The ork proteins when initially synthesized may comprise a signal 
peptide as well. 
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The expression pattern of the ork gene suggests that it is predominately 
expressed in endothelial cells. The structure of the extracellular domain, with the three 
EGF repeats, is especially intriguing in light of this. Several cell adhesion molecules, 
including endothelial-leukocyte adhesion molecule- 1 (ELAM-1) contain EGF repeats. 
5 ELAM-1 is expressed on the surface of activated endothelial cells and is involved in the 
attachment of neutrophils at sites of inflammation (Bevilacqua et al., Science 
243:1150-1 165, 1989; Siegelman et al., Cell 61:61 1-622, 1990). Several adhesion 
molecules also contain Ig and FNIII repeats as well. This suggests a possible role for 
the ork gene product in the communication between endothelial cells and leukocytes at 

10 sites of inflammation. cDNA encoding an ork polypeptide may be isolated from other 
mammalian species by procedures analogous to those employed in isolating the human 
ork clone. For example, a cDNA library derived from another mammalian species may 
be substituted for the human cDNA library that was screened using the degenerate 
probes in example 1 below. Alternatively (and preferably), the human ork cDNAs 

15 isolated in example 1 are labeled and used as probes to screen mammalian cDNA or 
genomic libraries using cross-species hybridization techniques. The probe may be 
derived from the coding region of the above-described 2.3kb or 4.0 kb human ork 
cDNAs. 

Murine ork cDNA was identified by cross-species hybridization. A murine 

20 cDNA library was screened using the 2.3 Kb human ork cDNA as a probe, as 
described in example 5. 

Cell types from which cDNA and genomic libraries may be prepared include 
those in which ork RNA expression was detected in example 3. mRNAs isolated from 
various cell lines can be screened by Northern hybridization to determine additional 

25 suitable sources of mammalian ork mRNA for use in cloning an ork gene. Nucleic acid 
from mammalian sources that include but are not limited to murine, bovine, porcine, 
and primate, may be screened to identify ork genes. 

In addition to the membrane-bound full length protein depicted in SEQ ID 
NO:l, the present invention provides soluble forms of the ork protein. "Soluble ork" 

30 as used in the context of the present invention refers to polypeptides that contain all or 
pan of the extracellular region of a native ork and that, due to the absence of a 
transmembrane region that would cause retention of the polypeptide on a cell 
membrane, are secreted upon expression. Fragments of the extracellular domain may 
be employed as long as the fragment possesses the desired biological activity (e.g., 

35 binding to an 'dmi~ork antibody or to the ligand for ork). Soluble ork may also include 
part of the transmembrane region or pan of the cytoplasmic domain or other sequences, 
provided that the soluble ork protein is capable of being secreted. Preferred soluble ork 
polypeptides include the signal sequence and entire extracellular domain (amino acids 1 
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to 745 of SEQ ID NO: 1) or lack the signal sequence but contain the entire extracellular 
domain (amino acids 19 to 745 of SEQ ID NO: 1). 

Soluble ork polypeptides may be identified (and distinguished from their non- 
soluble membrane-bound counterparts) by separating intact cells which express the 
5 protein in question from the culture medium, e.g., by centrifugation, and assaying the 
medium (supernatant) for the presence of ork. The presence of ork in the medium 
indicates that the protein was secreted from the cells and thus is a soluble form. 
Soluble ork may be naturally-occurring forms of these proteins, such as those resulting 
from alternative splicing events. Alternatively, soluble fragments of ork proteins may 
10 be produced by recombinant DNA technology or otherwise isolated, as described 
below. 

The use of soluble forms of ork is advantageous for certain applications. 
Purification of the proteins from recombinant host cells is facilitated, since the soluble 
proteins are secreted from the cells. The smaller soluble fragments may be 

15 advantageous for use in certain in vitro assays. The soluble ork polypeptides may be 
employed to competitively bind the ligand in vivo, thus inhibiting signal transduction 
activity via endogenous cell surface bound ork proteins. Further, soluble proteins are 
generally more suitable for intravenous administration and may exert their desired effect 
(e.g., binding a ligand) in the bloodstream. 

20 Truncated ork proteins, including soluble polypeptides, may be prepared by any 

of a number of conventional techniques. In the case of recombinant proteins, a DNA 
fragment encoding a desired ork fragment may be subcloned into an expression vector. 
A desired DNA sequence may be chemially synthesized using known techniques. 
DNA fragments also may be produced by restriction endonuclease digestion of a full 

25 length cloned DNA sequence, and isolated by electrophoresis on agarose gels. Linkers 
containing restriction endonuclease cleavage site(s) may be employed to insert the 
desired DNA fragment into an expression vector, or the fragment may be digested at 
cleavage sites naturally present therein. 

In another approach, enzymatic treatment (e.g., using Bal 31 exonuclease) may 

30 be employed to delete terminal nucleotides from a DNA fragment to obtain a fragment 
having a particular desired terminus. Among the commercially available linkers are 
those that can be ligated to the blunt ends produced by Bal 31 digestion, and which 
contain restriction endonuclease cleavage site(s). Alternatively, oligonucleotides that 
reconstruct the N- or C-terminus of a DNA fragment to a desired point may be 

35 synthesized. The oligonucleotide may contain a restriction endonuclease cleavage site 
upstream of the desired coding sequence and position an initiation codon (ATG) at the 
N-terminus of the coding sequence. The well known polymerase chain reaction 
procedure also may be employed to amplify a DNA sequence encoding a desired 
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protein fragment. 3' and 5' oligonucleotide primers that anneal to the ork DNA at the 
termini of a desired fragment are employed in the PCR reaction which is conducted 
using any suitable procedure, such as those described in Sarki et al M Science 239:487 
(1988); in Recombinant DNA Methodology, Wu et al., eds., Academic Press Inc., San 
5 Diego ( 1 989), pp. 1 89- 1 96; and in PCR Protocols: A Guide to Methods and 

Applications, Innis et al., eds., Academic Press, Inc. (1990). An example of a suitable 
PCR procedure is as follows. All temperatures are in degrees centigrade. The 
following PCR reagents are added to a 0.5 ml Eppendorf microfuge tube: 10 pi of 10X 
PCR buffer (500 mM KC1, 100 mM Tris-HCl, pH 8.3 at 25°C, 25 mM MgCl 2 , and 1 

10 mg/ml gelatin) (Perkin-Elmer Cetus, Norwalk, CN), 8^1 of a 2.5 mM solution 

containing each dNTP (2 mM dATP, 2 mM dCTP, 2 mM dGTP and 2 mM dTTP), 2.5 
units (0.5 fil of standard 5000 units/ml solution) of Taq DNA polymerase (Perkins- 
Elmer Cetus), 1 ng of template DNA, 100 picomoles of each of the oligonucleotide 
primers, and water to a final volume of 100 jaL The final mixture is then overlaid with 

15 100 nl parafin oil. PCR is carried out using a DNA thermal cycler (Ericomp, San 

Diego, CA). The template is denatured at 94° for 5 minutes and PCR is carried out for 
25 cycles of amplification using a step program (denaturation at 94°, 1.5 minutes; 
annealing at 60% 1 minute; extension at 72°, 1 minute). 

The present invention also provides full length ork protein or antigenic 

20 fragments thereof that can act as immunogens to generate antibodies specific to the ork 
immunogens. Monoclonal antibodies specific for ork or antigenic fragments thereof are 
prepared by prodedures that include those described in Example 4. The above- 
described procedures for producing ork fragments may be employed in producing ork 
fragments for use as immunogens. 

25 

Expression of Recombinant ork Proteins 

The present invention provides recombinant expression vectors to express DNA 
encoding the ork proteins of the present invention. The inventive recombinant 
expression vectors are replicable DNA constructs which contain a synthetic or cDNA- 

30 derived DNA sequence encoding an ork protein, operably linked to suitable 

transcriptional or translational regulatory elements. Examples of genetic elements 
having a regulatory role in gene expression include transcriptional promoters, operators 
or enhancers, a sequence encoding suitable mRNA ribosomal binding sites, and 
appropriate transcription and translation initiation and termination sequences. The 

35 ability to replicate in a host, usually conferred by an origin of replication, and a 
selection gene to facilitate recognition of transformants may additionally be 
incorporated. The regulatory elements employed in the expression vectors are generally 
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derived from mammalian, microbial, viral, or insect genes. Expression vectors derived 
from retroviruses also may be employed. 

DNA regions are operably linked when they are functionally related to each 
other. A DNA sequence encoding ork is said to be operably linked to one or more of 
5 the above-described regulatory elements when the ork DNA sequence is transcribed, or 
the resulting mRNA is translated, under the control of the regulatory element(s). 

Transformed host cells are cells which have been transformed or transfected 
with foreign DNA using recombinant DNA techniques. In the context of the present 
invention, the foreign DNA includes a sequence encoding the inventive ork protein. 

10 Host cells may be transformed for purposes of cloning or amplifying the foreign DNA, 
or may be transformed with an expression vector for production of the fusion protein 
under the control of appropriate promoters. Suitable host cells include prokaryotes, 
yeast, or higher eukaryotic cells. Appropriate cloning and expression vectors for use 
with bacterial, fungal, yeast, and mammalian cellular hosts are described by Pouwels et 

1 5 al. (Cloning Vectors: A Laboratory Manual, Elsevier, New York, 1985), the relevant 
disclosures of which is hereby incorporated by reference. Cell-free translation systems 
could also be employed to produce fusion protein using RNAs derived from the DNA 
constructs of the present invention. 

Prokaryotes include gram negative or gram positive organisms. Prokaryotic 

20 expression vectors generally comprise one or more phenotypic selectable markers, for 
example a gene encoding proteins conferring antibiotic resistance or supplying an 
autotrophic requirement, and an origin of replication recognized by the host to ensure 
amplification within the host. Examples of suitable prokaryotic hosts for 
transformation include £. coli, bacilli such as Bacillus subtilis, Salmonella 

25 typhimurium, and various species within the genera Pseudomonas, Streptomyces, and 
Staphylococcus, although others may also be employed as a matter of choice. 

Useful expression vectors for bacterial use can comprise a selectable marker and 
bacterial origin of replication derived from commercially available plasmids comprising 
genetic elements of the well-known cloning vector pBR322 (ATCC 37017). Such 

30 commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, 
Uppsala, Sweden) and pGEMl (Promega Biotec, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the 
structural sequence to be expressed. E. coli is typically transformed using derivatives 
of pBR322, a plasmid derived from an E. coli species (Bolivar et al., Gene 2:95, 

35 1977). pBR322 contains genes for ampicillin and tetracycline resistance, providing 
simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include 
the B-lactamase (penicillinase) and lactose promoter system (Chang et al., Nature 
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275:615, 1978; and Goeddel et al., Nature 287:544, 1979), the tryptophan (trp) 
promoter system (Goeddel et al., Nuci Acids Res. 5:4057, 1980; and EPA 36,776) 
and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, p. 412, 1982). A particularly useful bacterial expression system 
5 employs the phage X Pl promoter and cI857ts thermoinducible repressor. Plasmid 
vectors available from the American Type Culture Collection which incorporate 
derivatives of the X Pl promoter include plasmid pHUB2, resident in E. coli strain 
JMB9 (ATCC 37092) and pPLc28, resident in E. coli RR1 (ATCC 53082). 

The recombinant ork protein may also be expressed in yeast hosts, preferably 

10 from Saccharomyces species, such as S. cerevisiae. Yeast of other genera such as 
Pichia or Kluyveromyces may also be employed. Yeast vectors will generally contain 
an origin of replication from the 2\xm yeast plasmid or an autonomously replicating 
sequence (ARS), a promoter, DNA encoding the ork protein, sequences for 
polyadenylation and transcription termination and a selection gene. Yeast vectors may 

15 include origins of replication and selectable markers permitting transformation of both 
yeast and E. coli, e.g., the ampicillin resistance gene of E. coli and the S. cerevisiae 
trpl gene. The trp 1 gene provides a selection marker for a mutant strain of yeast 
lacking the ability to grow in tryptophan, and a promoter derived from a highly 
expressed yeast gene to induce transcription of a structural sequence downstream. The 

20 presence of the trpl lesion in the yeast host cell genome then provides an effective 
environment for detecting transformation by growth in the absence of tryptophan. 

Suitable promoter sequences in yeast vectors include the promoters for 
metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., /. Biol. Chem. 255:2073, 
1980) or other glycolytic enzymes (Hess et aL, 7. Adv. Enzyme Reg, 7:149, 1968; and 

25 Holland et al., Biochem. 77:4900, 1978), such as enolase, glyceraldehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucoses- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase and glucokinase. Examples of suitable vectors 
and promoters for use in yeast expression are further described in R. Hitzeman et al., 

30 EPA 73,657. 

Preferred yeast vectors can be assembled using DNA sequences from pBR322 
for selection and replication in E. coli (Amp 1 * gene and origin of replication) and yeast 
DNA sequences including a glucose-repressible ADH2 promoter and a-factor 
secretion leader. The ADH2 promoter has been described by Russell et al. (7. Biol 
35 Chem. 255:2674, 1982) and Beier et al., {Nature 500:724, 1982). Advantageously, a 
DNA segment encoding a leader sequence functional in yeast is operably linked to the 
5' end of the DNA encoding the ork protein. The encoded leader peptide promotes 
secretion of the ork protein from the host cell and is generally cleaved from the ork 
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protein upon secretion. As one example, the yeast a-factor leader, which directs 
secretion of heterologous proteins, can be inserted between the promoter and the 
structural gene to be expressed. See, e.g., Kurjan et ah, Cell 50:922, 1982; and Bitter 
et al., Proc. Natl. Acad, Sci. USA 87:5330, 1984. The leader sequence may be 
5 modified to contain, near its 3' end, one or more useful restriction sites to facilitate 
fusion of the leader sequence to foreign genes. 

Suitable yeast transformation protocols are known to those of skill in the art. 
An exemplary technique is described by Hinnen et al., Proc. Natl. Acad. Sci. USA 
75:1929, (1978), selecting for Trp+ transformants in a selective medium consisting of 
10 0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, 10|Hg/ml adenine and 
20 |ig/ml uracil. Host strains transformed by vectors comprising the above-described 
ADH2 promoter may be grown for expression in a rich medium consisting of 1% yeast 
extract, 2% peptone, and 1 % glucose supplemented with 80 jig/ml adenine and 80 
(ig/ml uracil. Derepression of the ADH2 promoter occurs upon exhaustion of medium 
15 glucose. Crude yeast supernatants are harvested by filtration and held at 4°C prior to 
further purification. 

Various mammalian or insect cell culture systems can be employed to express 
recombinant protein. Baculovirus systems for production of heterologous proteins in 
insect cells are reviewed by Luckow and Summers, Bio/Technology 6:41 (1988). 
20 Established cell lines of mammalian origin may be employed. Examples of suitable 

mammalian host cell lines include the COS-7 line of monkey kidney cells (ATCC CRL 
1651), described by Gluzman (Cell23:175 y 1981), CV-1 cells (ATCC CCL 70) also 
derived from monkey kidney, L cells, C127, 3T3, Chinese hamster ovary (CHO), 
HeLa and BHK cell lines. Mammalian expression vectors may comprise non- 
25 transcribed elements such as an origin of replication, a suitable promoter and enhancer 
linked to the gene to be expressed, and other 5* or 3' flanking nontranscribed 
sequences, and 5' or 3' nontranslated sequences, such as necessary ribosome binding 
sites, a poly-adenylation site, splice donor and acceptor sites, and transcriptional 
termination sequences. 

30 The transcriptional and translational control sequences in expression vectors to 

be used in transforming vertebrate cells may be provided by viral sources. For 
example, commonly used promoters and enhancers are derived from Polyoma, 
Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. DNA sequences 
derived from the SV40 viral genome, for example, SV40 origin, early and late 

35 promoter, enhancer, splice, and polyadenylation sites may be used to provide the other 
genetic elements required for expression of a heterologous DNA sequence. The early 
and late promoters are particularly useful because both are obtained easily from the 
virus as a fragment which also contains the SV40 viral origin or replication (Fiers et al., 
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Nature 273: 1 13, 1978). Smaller or larger SV40 fragments may also be used, provided 
the approximately 250 bp sequence extending from the Hind III site toward the Bgll 
site located in the viral origin of replication is included. Exemplary vectors can be 
constructed as disclosed by Okayama and Berg (Mol Cell Biol 3;280, 1983). A 
5 useful system for stable high level expression of mammalian receptor cDNAs in CI 27 
murine mammary epithelial cells can be constructed substantially as described by 
Cosman et al. (Mol Immunol 25:935, 1986). Other expression vectors for use in 
mammalian host cells are derived from retroviruses. 



10 Producing and Purifying the ork Protein 

The present invention provides substantially homogeneous ork protein, which 
may be produced by recombinant expression systems as described above or purified 
from naturally occurring cells. The ork protein is purified to substantial homogeneity, 
as indicated by a single protein band upon analysis by SDS-polyacrylamide gel 

1 5 electrophoresis (SDS-PAGE). 

In one embodiment of the present invention, ork is purified from a cellular 
source using any suitable protein purification technique. The tissues identified in 
example 3 as containing ork RNA (preferably placental or lung tissue) from a 
mammalian species of interest may be employed as sources of ork, for example. 

20 An alternative process for producing the recombinant ork protein of the present 

invention comprises culturing a host cell transformed with an expression vector 
comprising a DNA sequence that encodes said ork protein under conditions that 
promote expression of the ork protein, which is then purified from culture media or cell 
extracts. Any suitable purification process may be employed, with the procedure of 

25 choice varying according to such factors as the type of host cells and whether or not the 
desired protein is secreted from the host cells. The fusion protein will be secreted into 
the culture medium when it is initially fused to a signal sequence or leader peptide 
operative in the host cells, or when the protein comprises soluble forms of the ork 
polypeptides. 

30 For example, supernatants from expression systems which secrete recombinant 

protein into the culture medium can be first concentrated using a commercially available 
protein concentration filter, e.g., an Amicon or Millipore Pellicon ultrafiltration unit. 
Following the concentration step, the concentrate can be applied to a suitable 
purification matrix. For example, an immunoaffinity column comprising antibodies 

35 directed against ork and bound to a suitable support may be employed. A monoclonal 
antibody specific for ork may be prepared as described in example 4. Alternatively, an 
anion exchange resin can be employed, for example, a matrix or substrate having 
pendant diethylaminoethyl (DEAE) groups. The matrices can be acrylamide, agarose, 
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dextran, cellulose or other types commonly employed in protein purification. 
Alternatively, a cation exchange step can be employed. Suitable cation exchangers 
include various insoluble matrices comprising sulfopropyl or carboxymethyl groups. 
Sulfopropyl groups are preferred. One or more reversed-phase high performance 
5 liquid chromatography (RP-HPLC) steps employing hydrophobic RP-HPLC media, 
e.g., silica gel having pendant methyl or other aliphatic groups, can be employed to 
further purify ork. 

Recombinant protein produced in bacterial culture is usually isolated by initial 
extraction from cell pellets, followed by one or more concentration, salting-out, 

10 aqueous ion exchange or size exclusion chromatography steps. Finally, high 

performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of recombinant fusion proteins can 
disrupted by any convenient method, including freeze-thaw cycling, sonication, 
mechanical disruption, or use of cell lysing agents. 

15 Fermentation of yeast which express ork as a secreted protein greatly simplifies 

purification. Secreted recombinant protein resulting from a large-scale fermentation can 
be purified by methods analogous to those disclosed by Urdal et al. (J. Chromatog. 
296:111, 1984), involving two sequential, reversed-phase HPLC steps for purification 
of a recombinant protein on a preparative HPLC column. 

20 Some or all of the foregoing purification steps, in various combinations, can be 

employed to provide an essentially homogeneous recombinant protein. Recombinant 
cell culture enables the production of the ork protein free of those contaminating 
proteins which may be normally associated with ork as it is found in nature, e.g., in 
cells, cell exudates or body fluids. The foregoing purification procedures are among 

25 those that may be employed to purify non-recombinant ork proteins of the present 
invention as well. 

Variants and Derivatives of ork 

Variants and derivatives of native ork proteins that retain the desired biological 
30 activity are also within the scope of the present invention. An ork variant, as referred 
to herein, is a polypeptide substantially homologous to a native ork, but which has an 
amino acid sequence different from that of native ork (human, murine or other 
mammalian species) because of one or a plurality of deletions, insertions or 
substitutions. 

35 The variant amino acid sequence preferably is at least 80% identical to a native 

ork amino acid sequence, most preferably at least 90% identical. When the variant ork 
protein comprises extracellular, transmembrane and cytoplasmic domains, these percent 
identities apply to the entire sequence and also to the extracellular domain when taken 
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alone. Since the cytoplasmic domains are relatively conserved among members of the 
RTK family, the percent identity of the extracellular domain is important in identifying a 
variant as an ork protein. 

To illustrate this point, a comparison of the full length ork and tie amino acid 
5 sequences (aligned in Figure 5) reveals 76% identity for the cytoplasmic domains, 
whereas the percent identity drops to 47.5% for the full length sequences as a whole. 
The percent identity drops to 33% when just the extracellular domains are compared. A 
DNA probe corresponding to the extracellular region of tie would not hybridize to ork 
DNA under moderately stringent hybridization conditions. 
10 The degree of homology (percent identity) may be determined, for example, by 

comparing sequence information using the GAP computer program, version 6.0 
described by Devereux et al. (Nucl. Acids Res. 12:387, 1984) and available from the 
University of Wisconsin Genetics Computer Group (UWGCG). The GAP program 
utilizes the alignment method of Needleman and Wunsch (J. MoL Biol. 48:443, 
15 1970), as revised by Smith and Waterman (Adv. AppL Math 2:482, 1981). Briefly, 

the GAP program defines similarity as the number of aligned symbols (i.e., nucleotides 
or amino acids) which are similar, divided by the total number of symbols in the shorter 
of the two sequences. The preferred default parameters for the GAP program include: 
(1) a unary comparison matrix (containing a value of 1 for identities and 0 for non- 
20 identities) for nucleotides, and the weighted comparison matrix of Gribskov and 

Burgess, Nucl. Acids Res. 14:6745, 1986, as described by Schwartz and Dayhoff, 
eds., Atlas of Protein Sequence and Structure, National Biomedical Research 
Foundation, pp. 353-358, 1979; (2) a penalty of 3.0 for each gap and an additional 
0.10 penalty for each symbol in each gap; and (3) no penalty for end gaps. 
25 Alterations of the native amino acid sequence may be accomplished by any of a 

number of known techniques. Mutations can be introduced at particular loci by 
synthesizing oligonucleotides containing a mutant sequence, flanked by restriction sites 
enabling ligation to fragments of the native sequence. Following ligation, the resulting 
reconstructed sequence encodes an analog having the desired amino acid insertion, 
30 substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can 
be employed to provide an altered gene having particular codons altered according to 
the substitution, deletion, or insertion required. Exemplary methods of making the 
alterations set forth above are disclosed by Walder et al. (Gene 42: 133, 1986); Bauer et 
35 al. (Gene 37:73, 1985); Craik (BioTechniques, January 1985, 12-19); Smith et al. 

(Genetic Engineering: Principles and Methods, Plenum Press, 1981); and U.S. Patent 
Nos. 4,518,584 and 4,737,462, which are incorporated by reference herein. 
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Variants may comprise conservatively substituted sequences, meaning that a 
given amino acid residue is replaced by a residue having similar physiochemical 
characteristics. Examples of conservative substitutions include substitution of one 
aliphatic residue for another, such as He, Val, Leu, or Ala for one another, or 
5 substitutions of one polar residue for another, such as between Lys and Arg; Glu and 
Asp; or Gin and Asn. Other such conservative substitutions, for example, substitutions 
of entire regions having similar hydrophobicity characteristics, are well known. 

ork also may be modified to create ork derivatives by forming covalent or 
aggregative conjugates with other chemical moieties, such as glycosyl groups, lipids, 

10 phosphate, acetyl groups and the like. Covalent derivatives of ork may be prepared by 
linking the chemical moieties to functional groups on ork amino acid side chains or at 
the N-terminus or C-terminus of an ork polypeptide or the extracellular domain 
thereof. Other derivatives of ork within the scope of this invention include covalent or 
aggregative conjugates of ork or its fragments with other proteins or polypeptides, 

15 such as by synthesis in recombinant culture as N-terminal or C-terminal fusions. For 
example, the conjugate may comprise a heterologous signal or leader polypeptide 
sequence at the N-terminus of an ork polypeptide. Examples of such signal peptides 
are the a-factor leader of Saccharomyces; the signal sequence for interleukin-7 (IL-7) 
described in United States Patent 4,965,195; or the signal sequence for interleukin-2 

20 receptor described in United States Patent Application 06/626,667 filed on July 2, 
1984. The signal or leader peptide co-translationally or post-translationally directs 
transfer of the conjugate from its site of synthesis to a site outside of the cell membrane 
or cell wall. 

ork polypeptide fusions can comprise peptides added to facilitate purification 
25 and identification of ork . Such peptides include, for example, poly-His or the 

antigenic identification peptides described in U.S. Patent No. 5,011,912 and in Hopp 
et al., Bio/Technology 6:1204, 1988. One such peptide is the FLAG® peptide, Asp- 
Tyr-Lys-Asp-Asp-Asp-Asp-Lys (DYKDDDDK), which is highly antigenic and 
provides an epitope reversibly bound by a specific monoclonal antibody enabling rapid 
30 assay and facile purification of expressed recombinant protein. This sequence is also 
specifically cleaved by bovine mucosal enterokinase at the residue immediately 
following the Asp-Lys pairing. Fusion proteins capped with this peptide may also be 
resistant to intracellular degradation in E. coli. A murine hybridoma designated 4E1 1 
produces a monoclonal antibody that binds the peptide DYKDDDDK in the presence of 
35 certain divalent metal cations (as described in U.S. Patent 5,01 1,912) and has been 
deposited with the American Type Culture Collection under accession no HB 9259. 

The present invention further includes ork polypeptides with or without 
associated native-pattern glycosylation. ork expressed in yeast or mammalian 
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expression systems (e.g., COS-7 cells) may be similar to or significantly different 
from a native ork polypeptide in molecular weight and glycosylation pattern, 
depending upon the choice of expression system. Expression of ork polypeptides in 
bacterial expression systems, such as E. coli, provides non-glycosylated molecules. 
5 DNA constructs that encode various additions or substitutions of amino acid 

residues or sequences, or deletions of terminal or internal residues or sequences not 
needed for biological activity or binding, can be prepared. For example, N- 
glycosylation sites in the ork extracellular domain can be modified to preclude 
glycosylation while allowing expression of a homogeneous, reduced carbohydrate 

10 analog using yeast expression systems. N-glycosylation sites in eukaryotic 

polypeptides are characterized by an amino acid triplet Asn-X-Y, wherein X is any 
amino acid except Pro and Y is Ser or Thr. Appropriate modifications to the nucleotide 
sequence encoding this triplet will result in substitutions, additions or deletions that 
prevent attachment of carbohydrate residues at the Asn side chain. Known procedures 

15 for inactivating N-glycosylation sites in proteins include those described in U.S. Patent 
5,071,972 and EP 276,846. In another example, sequences encoding Cys residues 
that are not essential for biological activity can be altered to cause the Cys residues to be 
deleted or replaced with other amino acids, preventing formation of incorrect 
intramolecular disulfide bridges upon renaturation. Other variants are prepared by 

20 modification of adjacent dibasic amino acid residues to enhance expression in yeast 

systems in which KEX2 protease activity is present. EP 212,914 discloses the use of 
site-specific mutagenesis to inactivate KEX2 protease processing sites in a protein. 

Naturally occurring ork variants are also encompassed by the present 
invention. Examples of such variants are proteins that result from alternative mRNA 

25 splicing events (since ork is encoded by a multi-exon gene) or from proteolytic 

cleavage of the ork protein, provided the desired biological activity (e.g., binding to an 
■ 2iX\t\-ork antibody or to the ligand) is retained. Alternative splicing of mRNA may yield 
a truncated but biologically active ork protein, such as a naturally occurring soluble 
form of the protein, for example. An alternative splicing event in tie, an RTK of the 

30 same subgroup as ork, yielded a protein lacking the first of three fibronectin type III 
repeats. Variations attributable to proteolysis include, for example, differences in the 
N- or C-termini upon expression in different types of host cells, due to proteolytic 
removal of one or more terminal amino acids (which may occur intracellular^ or during 
purification). Varying N-termini may also result from cleavage of the signal peptide in 

35 certain host cells at a point other than between amino acids 18 and 19 of the disclosed 
sequence. 

In certain host cells, post-translatronal processing will remove the methionine 
residue encoded by an initiation codon, whereas the methionine residue will remain at 
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the N-terminus of proteins produced in other host cells. The N-terminal amino acid 
may, for example, be any of the amino acids at positions 1 to 5 of SEQ ID NO:l (for 
proteins comprising a signal peptide) or 19-23 (for the mature protein). The C- 
terminus may be truncated deliberately during expression vector construction (e.g., in 
5 constructing vectors encoding soluble proteins as described above) or as a result of 
differential processing which may remove up to about five C-terminal amino acids, for 
example. 

Due to the known degeneracy of the genetic code wherein more than one codon 
can encode the same amino acid, a DNA sequence may vary from that presented in SEQ 
10 ID NO:l , and still encode an ork protein having the amino acid sequence of SEQ ID 
NO:l. Such variant DNA sequences may result from silent mutations (e.g., occurring 
during PCR amplification), and may be the product of deliberate mutagenesis of a 
native sequence. 

Nucleic acid sequences within the scope of the present invention include 

15 isolated DNA and RNA sequences that hybridize to the ork nucleotide sequences 

disclosed herein under conditions of moderate or severe stringency, and which encode 
biologically active ork. Moderate stringency hybridization conditions refer to 
conditions described in, for example, Sambrook et al. Molecular Cloning: A 
Laboratory Manual, 2 ed. Vol. 1, pp. 1.101-104, Cold Spring Harbor Laboratory 

20 Press, (1989). Conditions of moderate stringency, as defined by Sambrook et aL, 

include use of a prewashing solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0) 
and hybridization conditions of about 55°C, 5 X SSC, overnight. Conditions of severe 
stringency include higher temperatures of hybridization and washing. The skilled 
artisan will recognize that the temperature and wash solution salt concentration may be 

25 adjusted as necessary according to factors such as the length of the probe. 

The present invention thus provides isolated DNA sequences encoding 
biologically active ork , selected from: (a) DNA derived from the coding region of a 
native mammalian ork gene (e.g., cDNA derived from the coding region of the human 
ork cDNA sequence presented in SEQ ID NO:l); (b) DNA capable of hybridizing 

30 under moderately stringent conditions to a DNA derived from the extracellular region of 
the sequence presented in SEQ ID NO:l, and which encodes biologically active ork ; 
and (c) DNA which is degenerate as a result of the genetic code to a DNA defined in 
(a) or (b) and which encodes biologically active ork, 

ork polypeptides in the form of oligomers such as dimers or trimers are within 

35 the scope of the present invention. Oligomers may be linked by disulfide bonds formed 
between cysteine residues on different ork polypeptides. In one embodiment of the 
invention, an ork dimer is created by fusing ork to the Fc region of an antibody (IgGl). 
The Fc polypeptide preferably is fused to the C-terminus of a soluble ork (comprising 
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only the extracellular domain). A gene fusion encoding the ork fusion protein is 
inserted into an appropriate expression vector. The ork fusion proteins are allowed to 
assenble much like antibody molecules, whereupon interchain disulfide bonds form 
between Fc polypeptides, yielding divalent ork. If fusion proteins are made with both 
5 heavy and light chains of an antibody, it is possible to form an ork oligomer with as 
many as four ork extracellular regions. Alternatively, one can link two soluble ork 
domains with a peptide linker such as the Gly4SerGly5Ser linker sequence described in 
United States Patent 5,073,627. A fusion protein comprising two or more ork 
polypeptides (with or without peptide spacers) may be produced by recombinant DNA 
10 technology. 

The present invention further provides antisense or sense oligonucleotides 
comprising a single-stranded nucleic acid sequence (either RNA or DNA) capable of 
binding to target ork mRNA (sense) or ork DNA (antisense) sequences. Antisense or 
sense oligonucleotides, according to the present invention, comprise a fragment of the 

15 coding region of ork cDNA. Such a fragment generally comprises at least about 14 
nucleotides, preferably from about 14 to about 30 nucleotides. The ability to create an 
antisense or a sense oligonucleotide, based upon a cDNA sequence for a given protein 
is described in, for example, Stein and Cohen, Cancer Res. 48:2659, 1988 and van der 
Krol et al., BioTechniques 6:958, 1988. 

20 Binding of antisense or sense oligonucleotides to target nucleic acid sequences 

results in the formation of duplexes that block translation (RNA) or transcription 
(DNA) by one of several means, including enhanced degradation of the duplexes, 
premature termination of transcription or translation, or by other means. The antisense 
oligonucleotides thus may be used to block expression of ork proteins. Antisense or 

25 sense oligonucleotides further comprise oligonucleotides having modified sugar- 
phosphodiester backbones (or other sugar linkages, such as those described in 
WO91/06629) and wherein such sugar linkages are resistant to endogenous nucleases. 
Such oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of 
resisting enzymatic degradation) but retain sequence specificity to be able to bind to 

30 target nucleotide sequences. Other examples of sense or antisense oligonucleotides 

include those oligonucleotides which are covalently linked to organic moieties, such as 
those described in WO 90/10448, and other moieties that increase affinity of the 
oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further 
still, intercalating agents, such as ellipticine, and alkylating agents or metal complexes 

35 may be attached to sense or antisense oligonucleotides to modify binding specificities of 
the antisense or sense oliginucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the 
target nucleic acid sequence by any gene transfer method, including, for example, 
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CaP04-mediated DNA transfection, electroporarion, or other gene transfer vectors such 
as Epstein-Barr virus. Antisense or sense oligonucleotides are preferably introduced 
into a cell containing the target nucleic acid sequence by insertion of the antisense or 
sense oligonucleotide into a suitable retroviral vector, then contacting the cell with the 
5 retrovirus vector containing the inserted sequence, either in vivo or ex vivo. Suitable 
retroviral vectors include, but are not limited to, the murine retrovirus M-MuLV, N2 (a 
retrovirus derived from M-MuLV), or or the double copy vectors designated DCT5A, 
DCT5B and DCT5C (see PCT Application US 90/02656). Alternatively, other 
promotor sequences may be used to express the oligonucleotide. 

10 Sense or antisense oligonucleotides may also be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand 
binding molecule, as described in WO 91/04753. Suitable ligand binding molecules 
include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. 

15 Alternatively, a sense or an antisense oligonucleotide may be introduced into a 

cell containing the target nucleic acid sequence by formation of an oligonucleotide-lipid 
complex, as described in WO 90/10448. The sense or antisense oligonucleotide-lipid 
complex is preferably dissociated within the cell by an endogenous lipase. 
Uses of Ork Proteins 

20 One use of ork is as a research tool for identifying the ligand that binds thereto 

and studying the biological effects of ligand binding. The ork polypeptides of the 
present invention also may be employed in in vitro assays for detection of ork or its 
ligand or the interactions thereof. 

The ork polypeptides of the present invention can be used in a binding assay to 

25 detect cells expressing a ligand for ork. For example, ork or an extracellular domain or 
a fragment thereof can be conjugated to a detectable moiety such as 1^1. 
Radiolabeling with 125] can 5 e performed by any of several standard methodologies 
that yield a functional molecule labeled to high specific activity. Alternatively, 

another detectable moiety such as an enzyme that can catalyze a colorometric or 

30 fluorometric reaction, biotin or avidin may be used. Cells to be tested for ork ligand 
expression can be contacted with the labeled ork. After incubation, unbound labeled 
ork is removed and binding is measured using the detectable moiety. 

Soluble ork polypeptides may be employed to competitively bind the ligand in 
vivo, thus inhibiting signal transduction activity via endogenous cell surface bound ork 

35 proteins. Further, soluble proteins are generally more suitable for intravenous 
administration and may exert their desired effect (e.g. binding a ligand) in the 
bloodstream. Soluble ork proteins comprising only the extracellular (ligand-bihding) 
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domain lack the tyrosine kinase domain which is located within the cytoplasmic 
domain. 

The ork polypeptides disclosed herein are also useful in generating antibodies 
specific to the ork immunogens. Monoclonal antibodies specific for ork or antigenic 
5 fragments thereof may be prepared by procedures that include those described in 

Example 4. The monoclonal antibodies can be attached to insoluble support materials 
for use in immunoaffinity column purification of ork proteins. Other uses for the 
antibodies include detecting ork proteins in in vitro assays and identifying and purifying 
additional ork polypeptides such as variants (e.g., from alternative splicing events) that 
10 comprise the region from which the immunogen was derived. 



EXAMPLE 1 : PCR-Based Cloning of Human ork cDNA 
Degenerate oligonucleotide primers were synthesized corresponding to the 
conserved sequences HRDLAA (TK-1; sense orientation) and SDVWS (TK-2; 
15 antisense orientation) contained within the kinase domain of all RTKs (Hanks, et aL, 
1988). The oligonucleotides corresponding to the conserved motifs HRDLAA and 
SDVWS were 

(S'-CACC/AAGG/AGAC/jC/yTGGCA/TGC-S') and 

(S'-aG^GACCAA/T/^qacG^tcG/aCT-S'), respectively. Both oligonucleotides 

20 were 32-fold degenerate and contained recognition sequences for Xho I. The 

HRDLAA sequence was chosen to favor RTKs over .srofamily tyrosine kinases that 
have the sequence HRDLRA (Hanks et aL, 1988). 

Single-stranded cDNA was synthesized from human placental polyadenylated 
RNA by standard methods. The single stranded cDNA was used as template for a PCR 

25 reaction using the conditions of Wilks (1989). The two degenerate oligonucleotides 
were used as primers in the PCR reaction. Amplified PCR reaction products of about 
200 base pairs were digested with Xho I and ligated into an Xho I-digested plasmid 
vector desinated pBLUESCRIPT® SK. This vector, available from Stratagene 
Cloning Systems, La Jolla, California, is replicable in E. coli and contains a polylinker 

30 segment that includes 21 unique restriction sites, one of which is Xho I. The ligation 
mixture was transformed into E. coli cells by conventional procedures. 

Recombinant plasmids were recovered from the transformed E. coli cells and 
the nucleotide sequence of the DNA insert in a number of individual clones was 
determined. Among the known RTKs that were represented in the PCR-library were c- 

35 fms, JTK-4 (Partanen et aL Proc. Natl Acad, ScL USA, 87:8913-17, 1990), kdr 

(Terman et aL, Oncogene, 6:1677-1683, 1991), eph (Hirai et aL, 1987 supra), and/7r- 
4 (Aprelikova et aL Cancer Res., 52:746-748, 1992). However, one clone, 
designated HPK-6, contained a novel sequence. The approximately 200bp cDNA 
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insert of HPK-6 was isolated and radiolabeled with 32 P by conventional techniques for 
use as a probe to isolate longer cDNA sequences encoding the novel protein. 

An oligo-dT-primed human placental cDNA library in plasmid pDC302 has 
been previously described (Larsen et ai, J. Exp. Me±, 172:1559-1570, 1990). 
5 Briefly, total cell RNA was isolated from whole fresh placental tissue and 

polyadenylated RNA was prepared by chromatography on oligo(dT)-cellulose. 
Double-stranded, oligo(dT)-primed cDNA was prepared with a commercial kit 
(Amersham Corp., Arlington Heights, IL). The resulting cDNA was size fractionated 
by chromatography on Sephacryl S-1000 (Pharmacia Fine Chemicals, Piscataway, 
10 NJ) in 0.5 M sodium acetate. The excluded cDNA was cloned into the BglU site of the 
mammalian expresion vector, pDC302 (described below) by an adaptor method similar 
to that described by Haymerle et aL (Nucl. Acids Res. 14:8615, 1986). Briefly, 
noncomplementary oligonucleotides of the sequence 

S'-GATCTTGGAACGAGACGACCTGCT and S'-AGCAGGTCGTCTCGTTCCAA 

15 synthesized on a DNA synthesizer (model 380A; Applied Biosystems, Foster City, 

CA) were annealed and ligated in separate reactions to either cDNA or Bg/II cut vector. 
Nonligated oligonucleotides were separated from cDNA or vector by chromatography 
over Sepharose CL-2B (Pharmacia Fine Chemicals) at 65°C in lOmM Tris (pH 8.0), 
O.lmM EDTA. 5 ng of adaptored vector was ligated to adaptored cDNA in 10-(il 

20 reactions containing 50 mM sodium chloride, 50 mM Tris-HCl (pH 7.5), 10 mM 
magnesium chloride, 1 mM spermidine, 0.5 mM ATP, 0.1 U/|il T4 polynucleotide 
kinase and 0.4 U/jil T4 DNA ligase for 30 min at 37°C Reactions were then desalted 
by drop dialysis on VSWP 013 filters (Millipore Corp., Bedford, MA) against distilled 
water for 40 min immediately before electroporation into Escherichia coli strain DH5a 

25 Transformants were obtained with an average cDNA insert size of 1.6 kb. 

The pDC302 expression vector employed in preparing this cDNA library has 
been described by Mosley et al. (Cell 59:335, 1989). pDC302 is an expression vector 
for use in mammalian host cells, but also replicates in E. coli. 

pDC302 was assembled from pDC201 (Sims et al., Science 241:585, 1988), 

30 SV40 and cytomegalovirus DNA and comprises, in order with the direction of 

transcription from the origin of replication: (1) SV40 sequences from coordinates 
5171-270 including the origin of replication, enhancer sequences and early and late 
promoters; (2) human cytomegalovirus sequences including the promoter and enhancer 
regions (nucleotides 67 1 to 63 from the sequence published by Boechart et al. {Cell 

35 41:521, 1985); (3) adenovirus-2 sequences containing the first exon and part of the 
intron between the first and second exons of the tripartite leader, the second exon and 
pan of the third exon of the tripartite leader; and (4) a multiple cloning site (MCS) 
containing sites for Xhol, Asp718, Smal, NotI and BglU; (5) SV40 sequences from 
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coordinates 4127-4100 and 2770*2533 that include the polyadenylation and termination 
signals for early transcription; (6) sequences derived from pBR322 and virus- 
associated sequences VAI and VAII of pDC201, with adenovirus sequences 10532- 
1 1 156 containing the VAI and VAII genes, followed by pBR322 sequences from 4363- 
5 2486 and 1094-375 containing the ampicillin resistance gene and origin of replication. 
Miniprep DNA isolated from individual pools of colonies from the above- 
described placental cDNA library was digested with Bgl II (which excises the cDNA 
inserts) and screened for hybridization to the HPK-6-derived probe. A clone 
comprising a 2.3 Kb cDNA insert was isolated from one pool (#398) and the insert was 

10 radiolabeled for use in subsequent library screens. 

A random-primed cDNA library in A.gtl0 was generated frompoly(A) + human 
placental RNA and screened for hybridization to the 2.3 kb cDNA probe. One clone 
(32-1), about 4.0 Kb in length, contained an entire coding region, as well as 5' and 3' 
untranslated sequences. 

15 A DNA sequence for human ork is presented in SEQ ID NO:l, along with the 

amino acid sequence encoded thereby. The SEQ ID NO:l sequence was derived by 
combining DNA sequencing information from the fully-sequenced 2.3kb clone with 
that obtained for the 4.0 kb clone (for which certain portions overlapping with the 2.3 
kb clone were not fully sequenced). Beginning with an ATG codon at nucleotides 149- 

20 151 there is an open reading frame extending for 1 124 amino acids. Several lines of 
evidence suggest that this methionine codon is the initiating codon. The ATG codon is 
in a proper context (Kozak, 1984), and is followed by a hydrophobic stretch of amino 
acids that resembles a leader sequence (amino acids 1-18 of SEQ ID NO:l, i.e., 
methionine through glycine). There are two in-frame termination codons upstream of 

25 the ATG codon. The protein comprises an N- terminal extracellular (ligand-binding) 
domain (amino acids 19-745), followed by a transmembrane region comprising amino 
acids 746-772, and a C-terminal cytoplasmic domain (which contains the tyrosine 
kinase domain responsible for the tyrosine phosphorylating activity of the protein) 
comprising amino acids 773-1 124. 

30 The 2.3 kb clone encodes a C-terminal fragment of ork , including a small C- 

terminal portion of the extracellular domain, followed by the complete transmembrane 
and cytoplasmic domains of the protein (amino acids 698-1 124). The 2.3 kb ork 
cDNA extends from nucleotide 2240 to the 3* end of the SEQ ED NO:l DNA sequence 
and contains additional 3 1 non-coding sequences (i.e., the 2.3 kb clone has a longer 3* 

35 non-coding region than does the 4.0 kb clone depicted in SEQ ID NO:l). Since the 2.3 
kb clone lacks most of the extracellular domain, the tyrosine kinase encoded thereby is 
not expected to bind a ligand. 
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The 4.0 kb ork cDNA in plasmid pBLUESCRIPT®SK in E. coli strain DH5a 
was deposited with the American Type Culture collection on May 28, 1992 and was 
given accession number ATCC 69003. The strain deposit was made under the terms of 
the Budapest Treaty. 

5 

EXAMPLE 2: Characterization of ork Gene Product 
Rabbit polyclonal antiserum was generated against a peptide consisting of the 
carboxyl-terminal 21 amino acids of the ork protein cytoplasmic domain (SEQ ID NO:l 
sequence) conjugated to ovalbumin (Kitagawa and Aikawa,/. B/<?c/*e/n.79:233-236, 

10 1976). Animals were immunized with 100 (ig of the conjugate in complete Freund's 
adjuvant, followed by three boosts in incomplete adjuvant, after which time test bleeds 
were taken and assayed for immunoreactivity towards the peptide immobilized on 
nitrocellulose. Bleeds that showed anti-peptide activity were used in the following 
immunoprecipitation experiment. 

15 To examine the protein encoded by the ork cDNA, the full length cDNA was 

excised from XgtlO by Not I digestion and ligated into a A/of I-digested expression 
plasmid pDC302 (described above). Not I cleaves pDC302 at a unique site within a 
multiple cloning site. The resulting recombinant vectors were transfected into COS-7 
cells and 3 days later the cells were labelled with 35s-met/cys and detergent solubilized. 

20 The 35 S-labeled lysates were incubated with 5 |il of the anti-peptide serum for 1 hour at 
4°C, after which time 100 \x\ of pre-swollen proteinA-sepharose was added and the 
lysates were incubated for an additional hour at 4°C The immune complexes were 
washed 3 times with PBS/1% Triton X-100, boiled in SDS-PAGE sample buffer, and 
analyzed by SDS/polyacrylamide gel electrophoresis. The gel was treated with Amplify 

25 (Amersham, Arlington Heights, IL) and exposed to X-Ray film. 

As shown in Figure 3 (panel A), the immune serum, but not the pre-immune 
serum, precipitated an approximately 140 Kd protein in the lysates from the ork-. 
transfected cells, but not in lysates from control cells. Preincubation of the immune 
serum with the peptide specifically blocked the precipitation of the 140 Kd protein. 

30 A hallmark of protein tyrosine kinases is their ability to auto-phosphorylate in 

vitro in immune complexes (Ullrich and Schlessinger, 1990)). We tested the ability of 
the ork protein to autophosphorylate using the rabbit serum described above. Ork- or 
mock-transfected COS-7 cells were washed IX in cold PBS/0. ImM sodium 
orthovanadate and lysed at 10^ cells/ml in a buffer containing: 25mM TRIS pH 

35 8/150mM NaO/lmM EGTA/lmM DTT/1% NP-40/0.1mM sodium 

orthovanadate/lmM PMSF/10 |ig/ml leupeptin/10 |ig/ml pepstatin A. Lysates were 
incubated with 5 (il of the above-described rabbit anti-peptide serum or normal rabbit 
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serum for 2 hours at 4°C with 15 fil (packed volume) protein A-sepharose. Immune 
complexes were washed 3X with 25mM TRIS pH 8/150mM NaCl/1% Triton X- 
100/0.1% SDS/1% sodium deoxychoiate/lmM DTT/O.lmM sodium 
orthovanadate/lmM PMSF/ 10 |ig/ml leupeptin, followed by three washes in 20mM 
5 HEPES pH 7.4/1 OmM MnCl2/5mM MgCl2- In vitro phosphorylation reactions were 
initiated by suspending the beads to a final volume of 50 fil in a mixture containing [y- 
32p]ATP (25mM)/10mM MnCl2/5mM MgCl2/20mM HEPES pH 7 A After 
incubation for 30 minutes at 37°C the beads were washed with 20mM HEPES pH 
7.4/10mM ATP/5mM EGTA, boiled in SDS-PAGE sample buffer and separated on 8- 

10 16% SDS/polyacrylamide gels followed by autoradiography. For phosphoaminoacid 
analysis, separated proteins were electrophoretically transferred to membranes and 
regions containing 32p.i a 5 e |i ec j pro tein were excised and incubated with 6N HC1 for 1 
hour at 1 10°C. Hydrolysates were analysed by two dimensional thin-layer 
electrophoresis (Cooper et aL, 1983) and autoradiography. 

15 As shown in Figure 3, panel B, immunoprecipitated ork was capable of auto- 

phosphorylation. Phospho-amino acid analysis demonstrated that only phospho- 
tyrosine was present in the immunoprecipitates (Fig. 3, panel C). We also examined 
whether the ork expressed in the COS cells was phosphorylated on tyrosine residues. 
Lysates from ork- and control- transfected cells were immunoprecipated with the PI 

20 (polyclonal) serum, separated by SDS-PAGE, transferred to filters and incubated with 
an anti-phosphotyrosine anti-serum. As shown in Figure 3, panel D, a 140 Kd protein 
containing phosphotyrosine was seen only in the lysates from #r£-transfected cells that 
had been immunoprecipitated with the anti-or& serum. It is unclear from these data 
whether ork has a high level of intrinsic tyrosine kinase activity, whether the COS cells 

25 express a potential ligand for ork which is interacting with the ork protein on the COS 
cell surface, or whether ork is being phosphorylated by a tyrosine kinase expressed by 
COS cells. 

EXAMPLE 3: Screening Human Tissues for ork Expression 
RNAs from several human tissues were screened for ork transcripts. For ork 
30 expression, a filter containing 2|ag of poly(A) + RNA from various human tissues 
(Clontech, San Diego, CA) was hybridized with an antisense RNA probe containing 
nucleotides 2240-3005 (outside of the kinase domain). As shown in Figure 4, a 4.5 
kb mRNA was found at highest levels in placenta and lung, with lower levels detected 
in kidney and heart. Each of these tissues is highly vascularized, suggesting that ork 
35 may be expressed in endothelial cells. Consistent with this, umbilical cord vein 
endothelial cells (HU VECs) also expressed ork message. In addition to the tissues 
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listed in Figure 4, RNAs isolated from hematopoietic cell lines and tissues were 
surveyed, but ork mRNA was not detected. 

EXAMPLE 4: Monoclonal Antibodies Directed Against ork 
5 This example illustrates the preparation of monoclonal antibodies to ork or 

immunogenic fragments thereof. Purified ork can be used to generate anti-e>r& 
monoclonal antibodies using conventional techniques, for example, those techniques 
described in U.S. Patent 4,41 1,993. 

In one embodiment of the invention, the peptide used to generate rabbit 

10 polyclonal antiserum in example 2 is used to produce a monoclonal antibody. The 

peptide comprises the carboxy-terminal 21 amino acids of the ork protein (i.e., the last 
21 amino acids of the cytoplasmic domain in the SEQ ID NO:l sequence) conjugated to 
ovalbumin. Briefly, mice are immunized with the immunogenic peptide emulsified in 
complete Freund's adjuvant, and injected in amounts ranging from 10-100 (ig 

15 subcutaneously or intraperitoneally. Ten to twelve days later, the immunized animals 
are boosted with additional immunogenic peptide emulsified in incomplete Freund's 
adjuvant. Mice are periodically boosted thereafter on a weekly to bi-weekly 
immunization schedule. Serum samples are periodically taken by retro-orbital bleeding 
or tail-tip excision for testing by dot blot assay or ELISA (Enzyme-Linked 

20 Immunosorbent Assay), for anti-c?r& antibodies. 

Following detection of an appropriate antibody titer, positive animals are 
provided one last intravenous injection of the immunogen in saline. Three to four days 
later, the animals are sacrificed, spleen cells harvested, and spleen cells are fused to a 
murine myeloma cell line (e.g., NS 1 or Ag 8.653). Fusions generate hybridoma cells, 

25 which are plated in multiple microtiter plates in a HAT (hypoxanthine, aminopterin and 
thymidine) selective medium to inhibit proliferation of non-fused cells, myeloma 
hybrids, and spleen cell hybrids. 

The hybridoma cells are screened by ELISA for reactivity against purified ork 
peptide by adaptations of the techniques disclosed in Engvall et al., Immunochem. 

30 8:871, 1971 and in U.S. Patent 4,703,004. Positive hybridoma cells can be injected 
intraperitoneally into syngeneic BALB/c mice to produce ascites containing high 
concentrations of anti-orfc monoclonal antibodies. Alternatively, hybridoma cells can 
be grown in vitro in flasks or roller bottles by various techniques. Monoclonal 
antibodies produced in mouse ascites can be purified by ammonium sulfate 

35 precipitation, followed by gel exclusion chromatography. Alternatively, affinity 

chromatography based upon binding of antibody to protein A or protein G can also be 
used, as can affinity chromatography based upon binding to ork. 
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EXAMPLE 5: Cloning of Murine ork cDNA 
Murine ork cDNA was isolated by cross-species hybridization using human ork 
cDNA as a probe. A murine cDNA library was prepared from 12 1/2 day old mouse 
embryos. The library was screened by conventional techniques using the radiolabeled 
5 2.3 Kb human ork cDNA described in example 1 as a probe. 

The DNA sequence of the cDNA insert of a hybridizing clone was determined, 
and the amino acid sequence encoded thereby is presented in SEQ ID NO:3 and in 
Figure 6. The murine ork sequence (designated "m ,r ; in the top rows) is aligned with 
the corresponding portion of human ork ("h") in Figure 6. Identical amino acids are 
10 indicated with a line; conservative or similar amino acid changes are indicated with two 
dots or one dot, respectively. The percent similarity of these murine and human ork 
amino acid sequences is 99.2% and the percent identity is 97.5%, as determined by the 
GAP computer program. 

The murine ork sequence corresponds to amino acids 886-1 124 of human ork, 
15 a region falling within the cytoplasmic domain and extending to the C-terminus of the 
human ork protein. The murine cDNA may be used as a probe to screen the same or a 
different murine cDNA library to identify a full length murine ork clone. Alternatively, 
given the similarity of the human and murine sequences, murine libraries may be 
screened using a probe derived from the extracellular region of human ork. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

5 

<i) APPLICANT: Ziegler, Steven F. 

(ii) TITLE OF INVENTION: NOVEL TYROSINE KINASE 

10 (iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS: 

{A) ADDRESSEE: Immunex Corporation 

(B) STREET: 51 University Street 

15 (C) CITY: Seattle 

(D ) STATE: Washington 

<E) COUNTRY: US 

<F) ZIP: 93101 

20 (v) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



25 



30 



35 



40 



<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/905,600 

(B) FILING DATE: 26-JUN-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Seese, Kathryn A. 

(B) REGISTRATION NUMBER: 32,172 

(C) REFERENCE /DOCKET NUMBER: 2609 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 587-0430 

(B) TELEFAX: (206) 233-0644 

(C) TELEX: 756822 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 4138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: cDNA to mRNA 



<i>:) FEATURE: 

(A) NAME /KEY: CDS 
55 (B) LOCATION: 149.. 3523 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTTCTGTGCT GTTCCTTCTT GCCTCTAACT TGTAAACAAG ACGTACTAGG ACGATGCTAA 60 

TGGAAAGTCA CAAACCGCTG GGTTTTTGAA AGGATCCTTG GGACCTCATG CACATTTGTG 120 



60 



BNSDOCID <WO 94G0469A1> 



WO 94/00469 



PCT/US93/06093 



29 



GAAACTGGAT GGAGAGATTT GGGGAAGC ATG GAC TCT TTA GCC AGC TTA GTT 172 

Met Asp Ser Leu Ala Ser Leu Val 
1 5 

5 

CTC TGT GGA GTC AGC TTG CTC CTT TCT GGA ACT GTG GAA GGT GCC ATG 220 
Leu Cys Gly Val Ser Leu Leu Leu Ser Gly Thr Val Glu Gly Ala Met 
10 15 20 

10 GAC TTG ATC TTG ATC AAT TCC CTA CCT CTT GTA TCT GAT GCT GAA ACA 268 
Asp Leu lie Leu lie Asn Ser Leu Pro Leu Val Ser Asp Ala Glu Thr 
25 30 35 40 

TCT CTC ACC TGC ATT GCC TCT GGG TGG CGC CCC CAT GAG CCC ATC ACC 316 
15 Ser Leu Thr Cys lie Ala Ser Gly Trp Arg Pro His Glu Pro lie Thr 

45 50 55 

ATA GGA AGG GAC TTT GAA GCC TTA ATG AAC CAG CAC CAG GAT CCG CTG 364 
lie Gly Arg Asp Phe Glu Ala Leu Met Asn Gin His Gin Asp Pro Leu 
20 60 65 70 

GAA GTT ACT CAA GAT GTG ACC AGA GAA TGG GCT AAA AAA GTT GTT TGG 412 

Glu Val Thr Gin Asp Val Thr Arg Glu Trp Ala Lys Lys Val Val Trp 

75 80 85 

25 

AAG AGA GAA AAG GCT AGT AAG ATC AAT GGT GCT TAT TTC TGT GAA GGG 460 

Lys Arg Glu Lys Ala Ser Lys lie Asn Gly Ala Tyr Phe Cys Glu Gly 
90 95 100 

30 CGA GTT CGA GGA GAG GCA ATC AGG ATA CGA ACC ATG AAG ATG CGT CAA 508 
Arg Val Arg Gly Glu Ala lie Arg lie Arg Thr Met Lys Met Arg Gin 
105 110 115 120 

CAA GCT TCC TTC CTA CCA GCT ACT TTA ACT ATG ACT GTG GAC AAG GGA 556 
35 Gin Ala Ser Phe Leu Pro Ala Thr Leu Thr Met Thr Val Asp Lys Gly 

125 130 135 

GAT AAC GTG AAC ATA TCT TTC AAA AAG GTA TTG ATT AAA GAA GAA GAT 604 
Asp Asn Val Asn lie Ser Phe Lys Lys Val Leu lie Lys Glu Glu Asp 
40 140 145 " 150 

GCA GTG ATT TAC AAA AAT GGT TCC TTC ATC CAT TCA GTG CCC CGG CAT 652 
Ala Val lie Tyr Lys Asn Gly Ser Phe lie His Ser Val Pro Arg His 
155 160 165 



45 



GAA GTA CCT GAT ATT CTA GAA GTA CAC CTG CCT CAT GCT CAG CCC CAG 700 
Glu Val Pro Asp lie Leu Glu Val His Leu Pro His Ala Gin Pro Gin 
170 175 180 



50 GAT GCT GGA GTG TAC TCG GCC AGG TAT ATA GGA GGA AAC CTC TTC ACC 7 48 

Asp Ala Gly Val Tyr Ser Ala Arg Tyr lie Gly Gly Asn Leu Phe Thr 

185 190 195 200 

TCG GCC TTC ACC AGG CTG ATA GTC CGG AGA TGT GAA GCC CAG AAG TGG 7 96 

55 Ser Ala Phe Thr Arg Leu He Val Arg Arg Cys Glu Ala Gin Lys Trp 

205 210 215 

GGA CCT GAA TGC AAC CAT CTC TGT ACT GCT TGT ATG AAC AAT GGT GTC .844 

Gly Pro Glu Cys Asn His Leu Cys Thr Ala Cys Met Asn Asn Gly Val 

60 220 225 230 
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TGC CAT GAA GAT ACT GGA GAA TGC ATT TGC CCT CCT GGG TTT ATG GGA 8 92 

Cys His Glu Asp Thr Gly Glu Cys lie Cys Pro Pro Gly Phe Met Gly 

235 240 245 

5 AGG ACG TGT GAG AAG GCT TGT GAA CTG CAC ACG TTT GGC AGA ACT TGT 940 

Arg Thr Cys Glu Lys Ala Cys Glu Leu His Thr Phe Gly Arg Thr Cys 

250 255 260 

AAA GAA AGG TGC AGT GGA CAA GAG GGA TGC AAG TCT TAT GTG TTC TGT 988 

10 Lys Glu Arg Cys Ser Gly Gin Glu Gly Cys Lys Ser Tyr Val Phe Cys 

265 270 275 280 

CTC CCT GAC CCC TAT GGG TGT TCC TGT GCC AC A GGC TGG AAG GGT CTG 103 6 

Leu Pro Asp Pro Tyr Gly Cys Ser Cys Ala Thr Gly Trp Lys Gly Leu 
15 285 290 295 

CAG TGC AAT GAA GCA TGC CAC CCT GGT TTT TAC GGG CCA GAT TGT AAG 1084 

Gin Cys Asn Glu Ala Cys His Pro Gly Phe Tyr Gly Pro Asp Cys Lys 

300 305 310 



20 



CTT AGG TGC AGC TGC AAC AAT GGG GAG ATG TGT GAT CGC TTC CAA GGA 1132 

Leu Arg Cys Ser Cys Asn Asn Gly Glu Met Cys Asp Arg Phe Gin Gly 
315 320 325 

25 TGT CTC TGC TCT CCA GGA TGG CAG GGG CTC CAG TGT GAG AGA GAA GGC 1180 

Cys Leu Cys Ser Pro Gly Trp Gin Gly Leu Gin Cys Glu Arg Glu Gly 
330 335 340 

ATA CCG AGG ATG ACC CCA AAG ATA GTG GAT TTG CCA GAT CAT ATA GAA 1228 

30 lie Pro Arg Met Thr Pro Lys lie Val Asp Leu Pro Asp His lie Glu 
345 350 355 360 

GTA AAC AGT GGT AAA TTT AAT CCC ATT TGC AAA GCT TCT GGC TGG CCG 127 6 

Val Asn Ser Gly Lys Phe Asn Pro He Cys Lys Ala Ser Gly Trp Pro 

35 365 370 375 

CTA CCT ACT AAT GAA GAA ATG ACC CTG GTG AAG CCG GAT GGG ACA GTG 132 4 

Leu Pro Thr Asn Glu Glu Met Thr Leu Val Lys Pro Asp Gly Thr Val 
380 385 390 

40 

CTC CAT CCA AAA GAC TTT AAC CAT ACG GAT CAT TTC TCA GTA GCC ATA 1372 

Leu His Pro Lys Asp Phe Asn His Thr Asp His Phe Ser Val Ala He 
395 400 405 

45 TTC ACC ATC CAC CGG ATC CTC CCC CCT GAC TCA GGA GTT TGG GTC TGC 1420 

Phe Thr He His Arg He Leu Pro Pro Asp Ser Gly Val Trp Val Cys 
410 415 420 

AGT GTG AAC ACA GTG GCT GGG ATG GTG GAA AAG CCC TTC AAC ATT TCT 14 68 

50 Ser Val Asn Thr Val Ala Gly Met Val Glu Lys Pro Phe Asn lie Ser 
425 430 435 440 

GTT AAA GTT CTT CCA AAG CCC CTG AAT GCC CCA AAC GTG ATT GAC ACT 1516 

Val Lys Val Leu Pro Lys Pro Leu Asn Ala Pro Asn Val He Asp Thr 

55 445 450 455 

GGA CAT AAC TTT GCT GTC ATC AAC ATC AGC TCT GAG CCT TAC TTT GGG 1564 

Gly His Asn Phe Ala Val lie Asn lie Ser Ser Glu Pro Tyr Phe Gly 
460 465 470 

60 
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GAT GGA CCA ATC AAA TCC AAG AAG CTT CTA TAC AAA CCC GTT AAT CAC 1612 

Asp Gly Pro lie Lys Ser Lys Lys Leu Leu Tyr Lys Pro Val Asn His 
475 480 485 

5 TAT GAG GCT TGG CAA CAT ATT CAA GTG AC A AAT GAG ATT GTT ACA CTC 1660 

Tyr Glu Ala Trp Gin His lie Gin Val Thr Asn Glu lie Val Thr Leu 

490 495 500 

AAC TAT TTG GAA CCT CGG ACA GAA TAT GAA CTC TGT GTG CAA CTG GTC 17 08 

10 Asn Tyr Leu Glu Pro Arg Thr Glu Tyr Glu Leu Cys Val Gin Leu Val 

505 510 515 520 

CGT CGT GGA GAG GGT GGG GAA GGG CAT CCT GGA CCT GTG AGA CGC TTC 175 6 

Arg Arg Gly Glu Gly Gly Glu Gly His Pro Gly Pro Val Arg Arg Phe 
15 525 530 535 

ACA ACA GCT TCT ATC GGA CTC CCT CCT CCA AGA GGT CTA AAT CTC CTG 18 04 

Thr Thr Ala Ser He Gly Leu Pro Pro Pro Arg Gly Leu Asn Leu Leu 
540 545 550 

20 

CCT AAA AGT CAG ACC ACT CTA AAT TTG ACC TGG CAA CCA ATA TTT CCA 1852 

Pro Lys Ser Gin Thr Thr Leu Asn Leu Thr Trp Gin Pro He Phe Pro 
555 560 565 

25 AGC TCG GAA GAT GAC TTT TAT GTT GAA GTG GAG AGA AGG TCT GTG CAA 1900 

Ser Ser Glu Asp Asp Phe Tyr Val Glu Val Glu Arg Arg Ser Val Gin 

570 575 580 

AAA AGT GAT CAG CAG AAT ATT AAA GTT CCA GGC AAC TTG ACT TCG GTG 194 8 

30 Lys Ser Asp Gin Gin Asn He Lys Val Pro Gly Asn Leu Thr Ser Val 

585 590 595 600 

CTA CTT AAC AAC TTA CAT CCC AGG GAG CAG TAC GTG GTC CGA GCT AGA 1996 

Leu Leu Asn Asn Leu His Pro Arg Glu Gin Tyr Val Val Arg Ala Arg 
35 605 610 615 

GTC AAC ACC AAG GCC CAG GGG GAA TGG AGT GAA GAT CTC ACT GCT TGG 204 4 

Val Asn Thr Lys Ala Gin Gly Glu Trp Ser Glu Asp Leu Thr Ala Trp 
620 625 630 

40 

ACC CTT AGT GAC ATT CTT CCT CCT CAA CCA GAA AAC ATC AAG ATT TCC 20 92 

Thr Leu Ser Asp He Leu Pro Pro Gin Pro Glu Asn He Lys He Ser 
635 640 645 

45 AAC ATT ACA CAC TCC TCG GCT GTG ATT TCT TGG ACA ATA TTG GAT GGC 214 0 

Asn lie Thr His Ser Ser Ala Val He Ser Trp Thr He Leu Asp Gly 

650 655 660 

TAT TCT ATT TCT TCT ATT ACT ATC CGT TAC AAG GTT CAA GGC AAG AAT 2188 

50 Tyr Ser He Ser Ser lie Thr lie Arg Tyr Lys Val Gin Gly Lys Asn 

665 670 675 680 

GAA GAC CAG CAC GTT GAT GTG AAG ATA AAG AAT GCC ACC ATC ATT CAG 223 6 

Glu Asp Gin His Val Asp Val Lys He Lys Asn Ala Thr He lie Gin 
55 685 690 695 

TAT CAG CTC AAG GGC CTA GAG CCT GAA ACA GCA TAC CAG GTG GAC ATT 22 8 4 

Tyr Gin Leu Lys Gly Leu Glu Pro Glu Thr Ala Tyr Gin Val Asp He 
700 705 710 
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TTT GCA GAG AAC AAC ATA GGG TCA AGC AAC CCA GCC TTT TCT CAT GAA 2332 

Phe Ala Glu Asn Asn He Gly Ser Ser Asn Pro Ala Phe Ser His Glu 
715 720 725 

5 CTG GTG ACC CTC CCA GAA TCT CAA GCA CCA GCG GAC CTC GGA GGG GGG 2380 

Leu Val Thr Leu Pro Glu Ser Gin Ala Pro Ala Asp Leu Gly Gly Gly 
730 735 740 

AAG ATG CTG CTT ATA GCC ATC CTT GGC TCT GCT GGA ATG ACC TGC CTG 2428 

10 Lys Met Leu Leu He Ala He Leu Gly Ser Ala Gly Met Thr Cys Leu 

745 750 755 760 

ACT G71 CTG TTG GCC TTT CTG ATC ATA TTG CAA TTG AAG AGG GCA AAT 2 47 6 

Thr Val Leu Leu Ala Phe Leu He lie Leu Gin Leu Lys Arg Ala Asn 
15 765 770 775 

GTG CAA AGG AG A ATG GCC CAA GCC TTC CAA AAC GTG AGG GAA GAA CCA 2524 

Val Gin Arg Arg Met Ala Gin Ala Phe Gin Asn Val Arg Glu Glu Pro 
780 785 790 

20 

GCT GTG CAG TTC AAC TCA GGG ACT CTG GCC CTA AAC AGG AAG GTC AAA 2572 

Ala Val Gin Phe Asn Ser Gly Thr Leu Ala Leu Asn Arg Lys Val Lys 
795 800 805 

25 AAC AAC CCA GAT CCT AC A ATT TAT CCA GTG CTT GAC TGG AAT GAC ATC 2620 

Asn Asn Pro Asp Pro Thr He Tyr Pro Val Leu Asp Trp Asn Asp He 
810 815 820 

AAA TTT CAA GAT GTG ATT GGG GAG GGC AAT TTT GGC CAA GTT CTT AAG 2 668 

30 Lys Phe Gin Asp Val He Gly Glu Gly Asn Phe Gly Gin Val Leu Lys 

825 830 835 840 

GCG CGC ATC AAG AAG GAT GGG TTA CGG ATG GAT GCT GCC ATC AAA AGA 2716 

Ala Arg lie Lys Lys Asp Gly Leu Arg Met Asp Ala Ala He Lys Arg 
35 845 850 855 

ATG AAA GAA TAT GCC TCC AAA GAT GAT CAC AGG GAC TTT GCA GGA GAA 27 64 

Met Lys Glu Tyr Ala Ser Lys Asp Asp His Arg Asp Phe Ala Gly Glu 
860 " 865 870 



40 

CTG GAA GTT CTT TGT AAA CTT GGA CAC CAT CCA AAC ATC ATC AAT CTC 2 812 

Leu Glu Val Leu Cys Lys Leu Gly His His Pro Asn He lie Asn Leu 
875 880 885 

45 TTA GGA GCA TGT GAA CAT CGA GGC TAC TTG TAC CTG GCC ATT GAG TAC 2 8 60 

Leu Gly Ala Cys Glu His Arg Gly Tyr Leu Tyr Leu Ala He Glu Tyr 
890 895 900 

GCG CCC CAT GGA AAC CTT CTG GAC TTC CTT CGC AAG AGC CGT GTG CTG 2 908 

50 Ala Pro His Gly Asn Leu Leu Asp Phe Leu Arg Lys Ser Arg Val Leu 
905 910 915 920 

GAG ACG GAC CCA GCA TTT GCC ATT GCC AAT AGC ACC GCG TCC ACA CTG 2 956 

Glu Thr Asp Pro Ala Phe Ala He Ala Asn Ser Thr Ala Ser Thr Leu 
55 925 930 935 

TCC TCC CAG CAG CTC CTT CAC TTC GCT GCC GAC GTG GCC CGG GGC ATG 300 4 

Ser Ser Gin Gin Leu Leu His Phe Ala Ala Asp Val Ala Arg Gly Met 
940 * 945 950 

60 
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GAC TAC TTG AGC CAA AAA CAG TTT ATC CAC AGG GAT CTG GCT GCC AG A 3 052 

Asp Tyr Leu Ser Gin Lys Gin Phe lie His Arg Asp Leu Ala Ala Arg 
955 960 965 

5 AAC ATT TTA GTT GGT GAA AAC TAT GTG GCA AAA ATA GCA GAT TTT GGA 3100 
Asn He Leu Val Gly Glu Asn Tyr Val Ala Lys He Ala Asp Phe Gly 
970 975 980 

TTG TCC CGA GGT CAA GAG GTG TAC GTG AAA AAG ACA ATG GGA AGG CTC 3148 
10 Leu Ser Arg Gly Gin Glu Val Tyr Val Lys Lys Thr Met Gly Arg Leu 
985 990 995 1000 

CCA GTG CGC TGG ATG GCC ATC GAG TCA CTG AAT TAC AGT GTG TAC ACA 3196 
Pro Val Arg Trp Met Ala He Glu Ser Leu Asn Tyr Ser Val Tyr Thr 
15 1005 1010 1015 

ACC AAC AGT GAT GTA TGG TCC TAT GGT GTG TTA CTA TGG GAG ATT GTT 3244 

Thr Asn Ser Asp Val Trp Ser Tyr Gly Val Leu Leu Trp Glu lie Val 

1020 1025 1030 

20 

AGC TTA GGA GGC ACA CCC TAC TGC GGG ATG ACT TGT GCA GAA CTC TAC 32 92 

Ser Leu Gly Gly Thr Pro Tyr Cys Gly Met Thr Cys Ala Glu Leu Tyr 
1035 1040 1045 

25 GAG AAG CTG CCC CAG GGC TAC AGA CTG GAG AAG CCC CTG AAC TGT GAT 3340 
Glu Lys Leu Pro Gin Gly Tyr Arg Leu Glu Lys Pro Leu Asn Cys Asp 
1050 1055 1060 

GAT GAG GTG TAT GAT CTA ATG AGA CAA TGC TGG CGG GAG AAG CCT TAT 3388 
30 Asp Glu Val Tyr Asp Leu Met Arg Gin Cys Trp Arg Glu Lys Pro Tyr 
1065 1070 1075 1080 

GAG AGG CCA TCA TTT GCC CAG ATA TTG GTG TCC TTA AAC AGA ATG TTA 343 6 

Glu Arg Pro Ser Phe Ala Gin He Leu Val Ser Leu Asn Arg Met Leu 
35 1085 1090 1095 

GAG GAG CGA AAG ACC TAC GTG AAT ACC ACG CTT TAT GAG AAG TTT ACT 34 8 4 

Glu Glu Arg Lys Thr Tyr Val Asn Thr Thr Leu Tyr Glu Lys Phe Thr 
1100 1105 1110 



40 



TAT GCA GGA ATT GAC TGT TCT GCT GAA GAA GCG GCC TAGGACAGAA 3530 
Tyr Ala Gly He Asp Cys Ser Ala Glu Glu Ala Ala 

1115 1120 112 



45 CATCTGTATA CCCTCTGTTT CCCTTTCACT GGCATGGGAG ACCCTTGACA ACTGCTGAGA 359 0 

AAACATGCCT CTGCCAAAGG ATGTGATATA TAAGTGTACA TATGTGCTGG AATTCTAACA 3650 

AGTCATAGGT TAATATTTAA GACACTGAAA AATCTAAGTG ATATAAATCA GATTCTTCTC 3710 

50 

TCTCATTTTA TCCCTCACCT GTAGCATGCC AGTCCCGTTT CATTTAGTCA TGTGACCACT 377 0 

CTGTCTTGTG TTTCCACAGC CTGCAAGTTC AGTCCAGGAT GCTAACATCT AAAAATAGAC 3830 

55 TTAAATCTCA TTGCTTACAA GCCTAAGAAT CTTTAGAGAA GTATACATAA GTTTAGGATA 38 90 

AAATAATGGG ATTTTCTTTT CTTTTCTCTG GTAATATTGA CTTGTATATT TTAAGAAATA 3 950 

ACAGAAAGCC TGGGTGACAT TTGGGAGACA TGTGACATTT ATATATTGAA TTAATATCCC 4 010 

TACATGTATT GCACATTGTA AAAAGTTTTA GTTTTGATGA GTTGTGAGTT TACCTTGTAT 4 07 0 
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ACTGTAGGCA CACTTTGCAC TGATATATCA TGAGTGAATA AATGTCTTGC CTACTCAAAA 4130 
AAAAAAAA 4138 

5 

(2) INFORMATION FOR SEQ ID NO: 2: 

U) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



20 



35 



50 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp Ser Leu Ala Ser Leu Val Leu Cys Gly Val Ser Leu Leu Leu 
15 10 15 

Ser Gly Thr Val Glu Gly Ala Met Asp Leu lie Leu lie Asn Ser Leu 
20 25 30 



Pro Leu Val Ser Asp Ala Glu Thr Ser Leu Thr Cys lie Ala Ser Gly 

25 35 40 45 

Trp Arg Pro His Glu Pro lie Thr lie Gly Arg Asp Phe Glu Ala Leu 

50 55 60 

30 Met Asn Gin His Gin Asp Pro Leu Glu Val Thr Gin Asp Val Thr Arg 

65 70 75 80 



Glu Trp Ala Lys Lys Val Val Trp Lys Arg Glu Lys Ala Ser Lys He 

85 * 90 95 

Asn Gly Ala Tyr Phe Cys Glu Gly Arg Val Arg Gly Glu Ala He Arg 

100 105 110 



He Arg Thr Met Lys Met Arg Gin Gin Ala Ser Phe Leu Pro Ala Thr 

40 115 120 125 

Leu Thr Met Thr Val Asp Lys Gly Asp Asn Val Asn He Ser Phe Lys 

130 135 140 

45 Lys Val Leu lie Lys Glu Glu Asp Ala Val He Tyr Lys Asn Gly Ser 

145 150 155 160 



Phe He His Ser Val Pro Arg His Glu Val Pro Asp He Leu Glu Val 

165 170 175 

His Leu Pro His Ala Gin Pro Gin Asp Ala Gly Val Tyr Ser Ala Arg 

180 185 190 



Tyr He Gly Gly Asn Leu Phe Thr Ser Ala Phe Thr Arg Leu He Val 

55 195 200 205 

Arg Arg Cys Glu Ala Gin Lys Trp Gly Pro Glu Cys Asn His Leu Cys 
210 215 220 

60 Thr Ala Cys Met Asn Asn Gly Val Cys His Glu Asp Thr Gly Glu Cys 

225 ~ 230 " 235 240 
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He Cys Pro Pro Gly Phe Met Gly Arg Thr Cys Glu Lys Ala Cys Glu 
245 250 255 

5 Leu His Thr Phe Gly Arg Thr Cys Lys Glu Arg Cys Ser Gly Gin Glu 
260 265 270 

Gly Cys Lys Ser Tyr Val Phe Cys Leu Pro Asp Pro Tyr Gly Cys Ser ■ 
10 25 280 285 

Cys Ala Thr Gly Trp Lys Gly Leu Gin Cys Asn Glu Ala Cys His Pro 
0 295 300 

15 3u5 GlY Pr ° JfP CyS LyS Leu Ar * Ser Cys Asn Asn Gly 

310 315 320 

Glu Met Cys Asp Arg Phe Gin Gly Cys Leu Cys Ser Pro Gly Trp Gin 
325 33 0 335 

20 Gly Leu Gin Cys Glu Arg Glu Gly He Pro Arg Met Thr Pro Lys He 
340 345 350 

Val Asp Leu Pro Asp His lie Glu Val Asn Ser Gly Lys Phe Asn Pro 
25 355 360 365 

He Cys Lys Ala Ser Gly Trp Pro Leu Pro Thr Asn Glu Glu Met Thr 



375 



380 



30 3^5 ASP ™ y Thr Val Leu His Pro L ys Asp Phe Asn His 

385 390 395 400 

Thr Asp His Phe Ser Val Ala He Phe Thr lie His Arg He Leu Pro 
405 410 415 

35 Pro Asp Ser Gly Val Trp Val Cys Ser Val Asn Thr Val Ala Gly Met 
420 425 430 

Val Glu Lys Pro Phe Asn He Ser Val Lys Val Leu Pro Lys Pro Leu 
40 440 445 

Asn Ala Pro Asn Val He Asp Thr Gly His Asn Phe Ala Val He Asn 
qi >V 455 460 

45 «5 Gly ASP GlY Pr ° Ile L ^ s Ser L ys ^s 

470 475 4 8 o 

Leu Leu Tyr Lys Pro Val Asn His Tyr Glu Ala Trp Gin His He Gin 
485 4 90 495 

50 Val Thr Asn Glu He Val Thr Leu Asn Tyr Leu Glu Pro Arg Thr Glu 
500 505 510 

Tyr Glu Leu Cys Val Gin Leu Val Arg Arg Gly Glu Glv Gly Glu Gly 
55 515 520 525 

His Pro Gly Pro Val Arg Arg Phe Thr Thr Ala Ser He Gly Leu Pro 
3JU 535 54 0 

60 54? LSU LSU Pr ° LyS Ser Gln Thr Thr ^u As 



n 



555 560* 



BNSDOCID: <WO 9400469A1> 



WO 94/00469 



PCT/US93/06093 



36 

Leu Thr Trp Gin Pro He Phe Pro Ser Ser Glu Asp Asp Phe Tyr Val 
565 570 575 

Glu Val Glu Arg Arg Ser Val Gin Lys Ser Asp Gin Gin Asn He Lys 
5 580 585 590 

Val Pro Gly Asn Leu Thr Ser Val Leu Leu Asn Asn Leu His Pro Arg 
595 600 605 

10 Glu Gin Tyr Val Val Arg Ala Arg Val Asn Thr Lys Ala Gin Gly Glu 
610 " 615 620 



15 



30 



45 



Trp Ser Glu Asp Leu Thr Ala Trp Thr Leu Ser Asp He Leu Pro Pro 
625 630 635 640 

Gin Pro Glu Asn He Lys He Ser Asn He Thr His Ser Ser Ala Val 
645 650 655 



lie Ser Trp Thr lie Leu Asp Gly Tyr Ser lie Ser Ser lie Thr lie 

20 660 665 670 

Arg Tyr Lys Val Gin Gly Lys Asn Glu Asp Gin His Val Asp Val Lys 

675 680 685 

25 He Lys Asn Ala Thr lie lie Gin Tyr Gin Leu Lys Gly Leu Glu Pro 

690 695 700 



Glu Thr Ala Tyr Gin Val Asp lie Phe Ala Glu Asn Asn lie Gly Ser 

705 710 715 720 

Ser Asn Pro Ala Phe Ser His Glu Leu Val Thr Leu Pro Glu Ser Gin 

725 730 735 



Ala Pro Ala Asp Leu Gly Gly Gly Lys Met Leu Leu He Ala lie Leu 

35 740 745 750 

Gly Ser Ala Gly Met Thr Cys Leu Thr Val Leu Leu Ala Phe Leu He 

755 760 765 

40 lie Leu Gin Leu Lys Arg Ala Asn Val Gin Arg Arg Met Ala Gin Ala 

770 775 780 



Phe Gin Asn Val Arg Glu Glu Pro Ala Val Gin Phe Asn Ser Gly Thr 
785 790 795 800 

Leu Ala Leu Asn Arg Lys Val Lys Asn Asn Pro Asp Pro Thr lie Tyr 
805 810 815 



Pro Val Leu Asp Trp Asn Asp lie Lys Phe Gin Asp Val He Gly Glu 
50 -820 825 830 

Gly Asn Phe Gly Gin Val Leu Lys Ala Arg lie Lys Lys Asp Gly Leu 

835 840 845 

55 Arg Met Asp Ala Ala He Lys Arg Met Lys Glu Tyr Ala Ser Lys Asp 

850 855 860 



60 



Asp His -Arg Asp Phe Ala Gly Glu Leu Glu Val Leu Cys Lys Leu Gly 
865 870 875 880 
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His His Pro Asn lie lie Asn Leu Leu Gly Ala Cys Glu His Arg Gly 
885 890 895 

Tyr Leu Tyr Leu Ala lie Glu Tyr Ala Pro His Gly Asn Leu Leu Asp 
5 900 905 910 

Phe Leu Arg Lys Ser Arg Val Leu Glu Thr Asp Pro Ala Phe Ala lie 
915 920 925 

10 Ala Asn Ser Thr Ala Ser Thr Leu Ser Ser Gin Gin Leu Leu His Phe 
930 935 940 



15 



30 



45 



Ala Ala Asp Val Ala Arg Gly Met Asp Tyr Leu Ser Gin Lys Gin Phe 

945 950 955 960 

lie His Arg Asp Leu Ala Ala Arg Asn lie Leu Val Gly Glu Asn Tyr 
965 970 975 



Val Ala Lys lie Ala Asp Phe Gly Leu Ser Arg Gly Gin Glu Val Tyr 
20 980 985 990 

Val Lys Lys Thr Met Gly Arg Leu Pro Val Arg Trp Met Ala lie Glu 
995 1000 1005 

25 Ser Leu Asn Tyr Ser Val Tyr Thr Thr Asn Ser Asp Val Trp Ser Tyr 
1010 1015 1020 



Gly Val Leu Leu Trp Glu lie Val Ser Leu Gly Gly Thr Pro Tyr Cys 
1025 1030 1035 1040 

Gly Met Thr Cys Ala Glu Leu Tyr Glu Lys Leu Pro Gin Gly Tyr Arg 
1045 1050 1055 



Leu Glu Lys Pro Leu Asn Cys Asp Asp Glu Val Tyr Asp Leu Met Arg 
35 1060 1065 1070 

Gin Cys Trp Arg Glu Lys Pro Tyr Glu Arg Pro Ser Phe Ala Gin lie 
1075 1080 1085 

40 Leu Val Ser Leu Asn Arg Met Leu Glu Glu Arg Lys Thr Tyr Val Asn 
1090 1095 1100 



Thr Thr Leu Tyr Glu Lys Phe Thr Tyr Ala Gly lie Asp Cys Ser Ala 
1105 1110 1115 1120 

Glu Glu Ala Ala 



50 (2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Val Pro TrD Leu Gly Ala Cys Glu His Arg Gly Tyr Leu Tyr Leu Ala 
1*5 10 15 

5 

lie Glu Tyr Ala Pro His Gly Asn Leu Leu Asp Phe Leu Arg Lys Ser 
20 25 30 

Arg Val Leu Glu Thr Asp Pro Ala Phe Ala lie Ala Asn Ser Thr Ala 
10 35 40 45 

Ser lie Met Ser Ser Gin Gin Leu Leu His Phe Ala Ala Asp Val Ala 
50 55 60 

15 Arg Gly Met Asp Tyr Leu Ser Gin Lys Gin Phe lie His Arg Asp Leu 

65 70 75 80 



20 



35 



Ala Ala Arg Asn lie Leu Val Gly Glu Asn Tyr lie Ala Lys lie Ala 
85 90 95 

Asp Phe Gly Leu Ser Arg Gly Gin Glu Val Tyr Val Lys Lys Thr Met 
100 105 110 



Gly Arg Leu Pro Val Arg Trp Met Ala lie Glu Ser Leu Asn Tyr Ser 

25 115 120 125 

Val Tyr Thr Thr Asn Ser Asp Val Trp Ser Tyr Gly Val Leu Leu Trp 

130 135 140 

30 Glu lie Val Ser Leu Gly Gly Thr Pro Tyr Cys Gly Met Thr Cys Ala 

145 150 155 160 



Glu Leu Tyr Glu Lys Leu Pro Gin Gly Tyr Arg Leu Glu Lys Pro Leu 

165 170 175 

Asn Cys Asp Asp Glu Val Tyr Asp Leu Met Arg Gin Cys Trp Arg Glu 

180 185 190 



Lys Pro Tyr Glu Arg Pro Ser Phe Ala Gin lie Leu Val Ser Leu Asn 
40 195 200 205 

Arg Met Leu Glu Glu Arg Lys Thr Tyr Val Asn Thr Thr Leu Tyr Glu 
210 215 220 

45 Lys Phe Thr Tyr Ala Gly lie Asp Cys Ser Ala Glu Glu Ala Ala 

225 230 235 
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What is claimed is: 

5 1 . An isolated DNA sequence encoding a biologically active human ork 

polypeptide, wherein the DNA sequence is selected from the group consisting of: 

(a) DNA comprising the coding region of the DNA sequence of SEQ ED 

NO:l; 

(b) DNA sequences that hybridize under moderately stringent conditions to 
10 the DNA of (a) and which encode a biologically active ork; and 

(c) DNA sequences that are degenerate as a result of the genetic code to a DNA 
sequence of (a) or (b), and which encode a biologically active ork. 

2 . An isolated DNA sequence according to claim 1 wherein the ork 
15 polypeptide is a soluble human ork polypeptide. 

3. An isolated DNA sequence according to claim 2, wherein said soluble 
human ork polypeptide comprises amino acids 19-745 of the amino acid sequence of 
SEQ ID NO:l. 

20 

4. An isolated DNA sequence comprising a nucleotide sequence selected from 
the group consisting of nucleotides 149-3520, 203-3520, 149-2383, and 203-2383 of 
SEQ ID NO:l. 

25 5. An expression vector comprising a DNA sequence according to claim 1. 

6. An expression vector comprising a DNA sequence according to claim 2. 

7 . An expression vector comprising a DNA sequence according to claim 3. 

30 

8. An expression vector comprising a DNA sequence according to claim 4. 

9. A process for preparing an ork polypeptide, cpmprising culturing a host cell 
transformed with an expression vector according to claim 5 under conditions promoting 

35 expression of ork, and recovering the ork polypeptide from the culture. 
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10. A process for preparing an ork polypeptide, comprising culturing a host cell 
transformed with an expression vector according to claim 6 under conditions promoting 
expression of ork and recovering the ork polypeptide from the culture. 

5 1 1 . A process for preparing an ork polypeptide, comprising culturing a host cell 

transformed with an expression vectoraccording to claim 7 under conditions promoting 
expression of ork, and recovering the ork polypeptide from the culture. 

12. A process for preparing an ork polypeptide, comprising culturing a host cell 
10 transformed with an expression vector according to claim 8 under conditions promoting 

expression of ork, and recovering the ork polypeptide from the culture. 

13. A substantially homogeneous purified biologically active ork protein 
encoded by a DNA sequence selected from the group consisting of: 

1 5 (a) DNA comprising the coding region of the DNA sequence of SEQ ID 

NO:l; 

(b) DNA sequences that hybridize under moderately stringent conditions to 
the DNA of (a) and which encode a biologically active ork; and 

(c) DNA sequences that are degenerate as a result of the genetic code to a 
20 DNA sequence of (a) or (b), and which encode a biologically active ork, 

14. A purified ork according to claim 13, wherein said ork is human ork. 

15. A purified ork according to claim 14, wherein said ork is a soluble human 

25 ork. 

16. A purified human ork protein comprising an amino acid sequence selected 
from the group consisting of amino acids 19-1 124 of SEQ ED NO:2 and amino acids 
19-745 of SEQ ID NO:2. 

30 

17. A purified ork protein comprising an extracellular domain that comprises 
(from N- to C-terminus) an immunoglobulin-Iike domain, three EGF-like cysteine 
repeats, a sequence containing a pair of cysteine residues, and three FNIII-like repeats, 
wherein the amino acid sequence of said extracellular domain is at least 80% identical to 

35 the sequence presented as amino acids 19-745 of SEQ ID NO:l. 

18. A purified ork protein according to claim 17, wherein said protein 
additionally contains a transmembrane region and a cytoplasmic domain. 
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19. An antibody immunoreactive with ork or an ork immunogen. 

20. An antisense or sense oligonucleotide that can inhibit transcription or 

5 translation of ork, comprising a sequence of at least about 14 nucleotides of the DNA 
sequence of SEQ CD NO: 1, or its DNA or RNA complement. 
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F I GURE 3A 
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NOVEL TYROSINE KINASE 

5 

BACKGROUND 

The ability of cells to respond to environmental cues is in large part due to the 
interaction of cell-surface receptors with external stimuli. A number of factors that 
interact with receptors have been identified, among which are receptor-binding proteins 

10 that are soluble, membrane-bound, or exist in both forms. 

One class of receptors, the receptor tyrosine kinases (RTKs), has been 
intensively studied and shown to be crucial to the growth and differentiation of a variety 
of cell types (Yarden and Ullrich, Ann. Rev. Biochem. 57:433-478, 1988). Tyrosine 
kinases are enzymes that catalyze the phosphorylation of tyrosine residues. Tyrosine 

15 phosphorylation is associated with signal-transduction across the cellular plasma 
membrane. The protein-tyrosine kinase family can be grouped into two very broad 
families: the above-mentioned RTKs, which are, or are intimately associated with, 
membrane-spanning growth factor receptors; and those that are associated with the 
membrane but lack a transmembrane sequence (Yarden and Ullrich, 1988, supra.). 

20 The RTKs can be further divided into five subgroups on the basis of structural 

similarities in their extracellular domains and the organization of the tyrosine kinase 
catalytic region in their cytoplasmic domains. Subgroups I (epidermal growth factor 
(EGF) receptor-like ), II (insulin receptor-like) and the ephleck family contain cysteine- 
rich sequences (Hirai et al, Science 238:1717-1720, 1987; Yanden and Ullrich, Ann. 

25 Rev. Biochem, 57,443-478, 1988; ! A id berg and Hunter, Mol Cell Biol 10:6316- 
6324, 1990). The functional domains of the kinase region of these three classes of 
RTKs are encoded as a contiguous sequence (Hanks et al, Science 241, 42-52, 
1988). Subgroups III (platelet- derived growth factor (PDGF) receptor- like) and IV 
(the fibroblast growth factor (FGF) receptors) are characterized as having 

30 immunoglobulin (Ig)-like folds in their extracellular domains, as well as having their 
kinase domains divided in two pans by a variable stretch of unrelated amino acids 
(Yanden and Ullrich, 1988; Hanks et al, 1988). 

While all members of the RTK family share a related cytoplasmic catalytic 
domain, the extracellular, ligand-binding domains of these receptors have adapted 

35 patchwork structure utilizing several structural motifs. The variability in the structure 
of the ligand-binding domains of the RTKs almost certainly reflects the diversity of the 
ligands for these receptors (Ullrich and Schlessinger, Cell 61, 243-254, 1990). These 
ligands range from relatively small, soluble peptides to cell surface proteins that 
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themselves resemble receptors. Examples of ligands bound by certain members of the 
RTK family are polypeptide growth factors and hormones. 

To deal with this diversity of ligands the RTKs have evolved extracellular 
domains that are a composite of several structural motifs. For example, the 
5 extracellular domain of the oxllark gene contains both Ig-domains and fibronectin type 
III (FNIII) repeats (O'bryan et al., MoL Cell Biol. 1 1:5016-5031, 1991; Rescigno et 
al., Oncogene 6:1909-1913, 1991), while members of the eph family have those two 
motifs separated by a (non-EGF-like) cysteine-rich domain (Hirai et al., Science 
238:1717-1720, 1987; Lindberg et al., MoL Cell. Biol. 10:6316-6324, 1990; Lhotak et 

10 al., MoL CelL Biol. 11:2496-2502, 1991; Chan and Watt, Oncogene 6:1057- 

1061,1991). This diversity strongly suggests that this family of receptors evolved by 
accumulating the structural motifs needed for ligand binding and combining these 
motifs with a conserved catalytic domain. 

In view of the role tyrosine kinases play in cell growth and differentiation, as 

15 well as signal transduction, isolation of novel tyrosine kinases enables one to study 
such biological processes. Identifying tyrosine kinases and their ligands also permits 
exploration of methods for inhibiting or enhancing the interaction thereof, depending on 
the desired biological effect. 

SUMMARY OF INVENTION 
20 The present invention provides a novel. receptor (a protein tyrosine kinase), 

isolated DNA encoding the tyrosine kinase, recombinant expression vectors containing 
the isolated DNA, and host cells transformed with the recombinant vector. Also 
provided is a method for producing the novel protein by cultivating the transformed 
host cells under conditions that promote expression of the tyrosine kinase, and 
25 recovering the expressed tyrosine kinase. 

BRIEF DESCRIPTION OF FIGURES 
Figure 1 presents a nucleotide sequence and deduced amino acid sequence of 
human ork cDNA. The nucleotide sequence is derived from two overlapping cDNA 
clones isolated from placental cDNA libraries. The initiating methionine codon and the 

30 transmembrane region are each underlined. The two cysteines of the immunoglobulin 
domain are boxed, as are the three EGF-like repeats. Brackets enclose the region 
amplified by PCR using primers based on conserved kinase domain sequences, as 
described in example 1. Numbering of nucleotides is in the left margin, while 
numbering of amino acids is on the right. The signal peptide comprises amino acids 1- 

35 18, with the threonine residue at position 19 being the first amino acid of the mature 
protein. The extracellular domain comprises amino acids 19-745, and the cytoplasmic 
domain comprises amino acids 773-1124. 
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Figure 2 presents an analysis of the EGF repeats in the ork sequence. The 
three EGF- repeats in the ork sequence were aligned with those in the tan-l and laminin 
Bl genes. The conserved cysteine residues are boxed and other conserved residues 
(conserved in two ork repeats and at least one other repeat) are marked with an asterisk. 
5 Figure 3 presents the results of studies described in example 2 below. The 

human ork cDNA encodes a 140 Kd phosphoprotein with in vitro tyrosine kinase 
activity. A. COS cells were transfected with the ork cDNA {ork), or a control plasmid 
without cDNA insert (mock), metabolically labelled with 35s-met/cys, and 
immunoprecipitated with the PI rabbit antiserum, raised against the carboxy-terminal 

10 21 amino acids of the predicted ork amino acid sequence (PI), or with pre-immune 

serum (C). The position of P \AQ or ^ is indicated, as are the positions of size standards. 
B. COS cells were transfected and immunoprecipitated as described above. The 
immunoprecipitates were incubated with 32p-ATP and separated by SDS-PAGE. The 
positions of P140 or * and size standards are indicated. C. The gel from Panel B was 

1 5 transferred to a membrane, exposed to X-ray film and the band corresponding to 

was excised. The filter slice was boiled in 6N HC1 and eluted material was 
separated by two-dimensional electrophoresis. The positions of standards for 
phosphoserine, phosphothreonine and phosphotyrosine, as well as the origin, are 
indicated. D. COS cells were transfected and immunoprecipitated as described above. 

20 The immunoprecipitates were separated by SDS-PAGE, transferred to a membrane and 
incubated with an anti-phosphotyrosine antibody. The positions of size standards are 
indicated. 

Figure 4 shows the results of a study of expression of ork mRNA in human 
tissues, as described in example 3 below. Two micrograms of poly(A)-containing 
25 RNA from the indicated tissues was. hybridized with an 32p_uTP-labelled RNA probe 
from the ork cDNA. The positions of size standards are marked. 

Figures 5a and 5b depict a comparison of the amino acid sequences of ork 
(top lines) and tie (bottom lines), the latter being a tyrosine kinase of the same subgroup 
as ork. The sequence comparison was generated by the GAP computer program 
30 described. Identical amino acids are indicated with a line; conservative or similar 

amino acid changes are indicated with two dots or one dot, respectively. The percent 
similarity and percent identity for the ork and tie sequences were found to be 64.5% 
and 47.5%, respectively. 

Figure 6 presents a comparison of the amino acid sequence encoded by a 
35 murine ork clone with the corresponding portion of a human ork protein. 
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DETAILED DESCRIPTION OF INVENTION 
The present invention provides a novel receptor protein tyrosine kinase, isolated 
DNA encoding the tyrosine kinase, recombinant expression vectors containing the 
5 isolated DNA, and host cells transformed with the recombinant vector. A method for 
producing the novel protein involves cultivating the transformed host cells under 
conditions that promote expression of the tyrosine kinase, and recovering the expressed 
tyrosine kinase from the cell culture. 

The novel protein is designated herein as ork (orphan receptor tyrosine kinase). 

10 An ork cDNA isolated from human placenta and transfected into COS-7 cells encodes a 
140Kd protein with the following combination of structural motifs in its extracellular 
domain: an immunoglobulin (Ig)-like domain followed by three epidermal growth 
factor (EGF)-like cysteine-rich repeats, which in turn are followed by three fibronectin 
type III (FNII1) repeats. Of the human tissues tested (example 3 below), the ork gene 

1 5 is expressed predominately in placenta and lung, with lower levels in umbilical vein 
endothelial cells, brain and kidney. 

Human ork is within the scope of the present invention, as are ork proteins 
derived from other mammalian species. As used herein, the term "ork" includes 
membrane-bound proteins (comprising a cytoplasmic domain, a transmembrane region, 

20 and an extracellular domain) as well as truncated proteins that retain the desired 
biological properties. Such truncated proteins include, for example, soluble ork 
comprising only the extracellular (ligand binding) domain. 

Using a polymerase chain reaction-based approach we have isolated orkcDNA 
and characterized this novel receptor protein tyrosine kinase. As described in example 

25 1 below, degenerate oligonucleotide probes based on certain sequences that are 

conserved within the kinase domain of RTKs were used as primers in a PCR reaction. 
Single-stranded cDNA derived from human placental poly(A)+ RNA was employed as 
the template. The PCR reaction products were inserted into a cloning vector and the 
nucleotide sequence of the cDNA inserts in a number of the resulting recombinant 

30 vectors was determined- A clone containing a cDNA insert of about 200 basepairs with 
a novel DNA sequence was identified. 

The cDNA insert was excised, radiolabeled, and used to probe an oligo-dT- 
primed human placental cDNA library. A hybridizing clone comprising a 2.3 Kb insert 
was identified, sequenced, and found to be a partial clone encoding a C-terminal ork 

35 fragment. The cDNA insert was excised, radiolabeled, and used to probe a random- 
primed human placental cDNA library. A clone comprising a cDNA insert about 4.0 
Kb in length that contained an entire coding region for the novel protein, as well as 5' 
and 3' untranslated sequences, was isolated. The 4.0 kb ork cDNA in plasmid 
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pBLUESCRIPT^SK in E. coli strain DH5a was deposited with the American Type 
Culture collection on May 28, 1992 and was given accession number ATCC 69003. 
The strain deposit was made under the terms of the Budapest Treaty. 

A DNA sequence for a human ork cDNA is presented in Figure 1, along with 
5 the amino acid sequence encoded thereby. Human ork DNA and amino acid sequences 
identical to those of Figure 1 are presented in SEQ ED NO: 1 and SEQ ID NO:2. 
Several features of the ork amino acid sequence are noteworthy. The extracellular 
domain is a patchwork of three structural motifs. Between amino acid residues 21 1 and 
340 are three copies of an EGF-like cysteine repeat (Davis, New Biologist 2:410-419, 

10 1991). The EGF-like repeats in the ork gene (boxed in Figure 1) differ from the 

consensus motif in that they have 8 cysteines each instead of 6. In this regard they are 
closely related to those in the tie gene (Partanen et aL, MoL Cell Biol., 12:1698-1707, 
1992). The proteins with the next most closely related cysteine repeats are laminin Bl, 
laminin B2 (Sasaki et al., J. Biol. Chem 263:16536-16544, 1988) and TAN, the 

15 human homolog of the Drosophila notch gene (Ellison et al, Cell 66:649-661). The 
three EGF- repeats in the ork sequence were aligned with those in the tanA and laminin 
Bl genes in Figure 2. In each case the cysteines in the repeat were aligned. 

Amino terminal to the EGF repeats are paired cysteines (boxed in Figure 1) 
indicative of Ig-like domains. This Ig domain is most similar to the C2 type as 

20 described by Williams and Barclay, Ann. Rev. Immunol 6:381-405, 1988. A second 
pair of cysteine residues follow the EGF repeats; however the additional amino acid 
residues required to generate a proper Ig-fold are missing. At the membrane proximal 
region of the extracellular domain, between amino acids 440 and 733, are three repeats 
of the fibronectin type III motif. Thus, the extracellular domain of ork is a structural 

25 mosaic consisting of repeats of at least three different functional motifs. 

Amino acid residues 746-772 are hydrophobic in nature and likely serve as a 
transmembrane domain. The cytoplasmic domain contains all the amino acid sequence 
hallmarks of a tyrosine kinase (Hanks et al., Science 241:42-52, 1988). A comparison 
of the ork amino acid sequence with known protein tyrosine kinases showed ork to be a 

30 member of the RTK family. 

Within the receptor tyrosine kinase family, the ork and above-mentioned tie 
genes form their own subgroup. The tie gene (described by Partanen et al., supra) has 
an extracellular domain that, like ork, comprises (from N- to C-terminus) an Ig-like 
domain followed by three EGF-like domains, an incomplete Ig-like domain, and three 

35 fibronectin type III repeats next to the transmembrane region. Additionally, a cDNA 
form lacking the first of the three epidermal growth factor homology domains was 
isolated by Partanen et aL, supra, suggesting that alternative splicing creates different 
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ne-type receptors. COS-7 cells transfected with a tie cDNA expression vector produced 
a glycosylated protein of 1 17 kDa. 

A comparison of the amino acid sequences of ork and tie is presented in Figures 
5a and 5b, which shows them to be distinct proteins. The comparison presented in 
5 Figures 5a and 5b was generated using the GAP computer program, which is described 
in detail below. The percent similarity and percent identity of the ork and tie amino acid 
sequences was found to be 64.5% and 47.5%, respectively. The RTK subgroup 
containing ork and tie is characterized by a unique arrangement of structural motifs in 
the proteins' extracellular domain, as described above. There are also several features 

10 in the cytoplasmic kinase domain worth noting. There is a short amino acid insert 
between the two parts of the kinase domains (Fig. 1). This insert is similar in size to 
that seen in the ret gene (Takahashi and Cooper, MoL Cell BioL 7:1378-1385, 1987). 
However, unlike the insert sequences in other RTK subgroups, the insert in the ork and 
tie genes contains no tyrosine residues. Several tyrosine residues are found in the 

15 cytoplasmic tail. As phosphorylation of tyrosine in either the kinase insert or the 
cytoplasmic tail is required for association of the RTK molecule with putative signal 
transduction molecules, this feature of the ork and tie genes mostly closely resembles 
the EGFR subfamily where phosphorylated tyrosine residues in the cytoplasmic tail 
have been shown to associate with other proteins such as phospholipase C-y (Margolis 

20 etal.,Ce// 57:1101-1107, 1989). 

One embodiment of the present invention provides an ork protein comprising an 
extracellular domain that comprises (from N- to C-terminus) an immunoglobin-like 
domain, three EGF-like cysteine-rich repeats, and three FNIII repeats, wherein the 
amino acid sequence of said extracellular domain is at least 80% identical to the 

25 sequence shown as amino acids 1 9-745 of SEQ ID NO: 1 . The extracellular domain 
amino acid sequence preferably is at least 90% identical to the extracellular domain 
sequence of SEQ ID NO: 1 . The percent identity between two amino acid sequences 
may be determined by using the GAP computer program available from the University 
of Wisconsin and described in detail below. Such ork proteins include those having a 

30 transmembrane region and a cytoplasmic domain (or a portion thereof) in addition to the 
extracellular domain. Such proteins also include those in which a sequence containing 
a pair of cysteines is positioned between the last EGF-like repeat and the first FN III 
repeat. Such a sequence (a stretch of amino acids resembling an incomplete Ig-Iike 
domain) is found in the native human ork protein described above. Since this sequence 

35 is not believed to function as an Ig-like domain, it is possible that ork proteins lacking 
some or all of this partial Ig-like domain (amino acids 341-439) will retain the ability to 
bind a ligand. The ork proteins when initially synthesized may comprise a signal 
peptide as well. 
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The expression pattern of the ork gene suggests that it is predominately 
expressed in endothelial cells. The structure of the extracellular domain, with the three 
EGF repeats, is especially intriguing in light of this. Several cell adhesion molecules, 
including endothelial-leukocyte adhesion molecule- 1 (ELAM-1) contain EGF repeats. 
5 ELAM-1 is expressed on the surface of activated endothelial cells and is involved in the 
attachment of neutrophils at sites of inflammation (Bevilacqua et al., Science 
243:1 150-1 165, 1989; Siegelman et aL, Cell 61:61 1-622, 1990). Several adhesion 
molecules also contain Ig and FNIII repeats as well. This suggests a possible role for 
the ork gene product in the communication between endothelial cells and leukocytes at 

10 sites of inflammation. cDNA encoding an ork polypeptide may be isolated from other 
mammalian species by procedures analogous to those employed in isolating the human 
ork clone. For example, a cDNA library derived from another mammalian species may 
be substituted for the human cDNA library that was screened using the degenerate 
probes in example 1 below. Alternatively (and preferably), the human ork cDNAs 

15 isolated in example 1 are labeled and used as probes to screen mammalian cDNA or 
genomic libraries using cross-species hybridization techniques. The probe may be 
derived from the coding region of the above-described 2.3kb or 4.0 kb human ork 
cDNAs. 

Murine ork cDNA was identified by cross-species hybridization. A murine 

20 cDNA library was screened using the 2.3 Kb human ork cDNA as a probe, as 
described in example 5. 

Cell types from which cDNA and genomic libraries may be prepared include 
those in which ork RNA expression was detected in example 3. rnRNAs isolated from 
various cell lines can be screened by Northern hybridization to determine additional 

25 suitable sources of mammalian ork mRNA for use in cloning an ork gene. Nucleic acid 
from mammalian sources that include but are not limited to murine, bovine, porcine, 
and primate, may be screened to identify ork genes. 

In addition to the membrane-bound full length protein depicted in SEQ ID 
NO:l, the present invention provides soluble forms of the ork protein. "Soluble ork" 

30 as used in the context of the present invention refers to polypeptides that contain all or 
pan of the extracellular region of a native ork and that, due to the absence of a 
transmembrane region that would cause retention of the polypeptide on a cell 
membrane, are secreted upon expression. Fragments of the extracellular domain may 
be employed as long as the fragment possesses the desired biological activity (e.g., 

35 binding to an mm-ork antibody or to the ligand for ork). Soluble ork may also include 
part of the transmembrane region or part of the cytoplasmic domain or other sequences, 
provided that the soluble ork protein is capable of being secreted. Preferred soluble ork 
polypeptides include the signal sequence and entire extracellular domain (amino acids 1 
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to 745 of SEQ ED NO: 1) or lack the signal sequence but contain the entire extracellular 
domain (amino acids 19 to 745 of SEQ ID NO: 1). 

Soluble ork polypeptides may be identified (and distinguished from their non- 
soluble membrane-bound counterparts) by separating intact cells which express the 
5 protein in question from the culture medium, e.g., by centrifugation, and assaying the 
medium (supernatant) for the presence of ork. The presence of ork in the medium 
indicates that the protein was secreted from the cells and thus is a soluble form. 
Soluble ork may be naturally-occurring forms of these proteins, such as those resulting 
from alternative splicing events. Alternatively, soluble fragments of ork proteins may 
10 be produced by recombinant DNA technology or otherwise isolated, as described 
below. 

The use of soluble forms of ork is advantageous for certain applications. 
Purification of the proteins from recombinant host cells is facilitated, since the soluble 
proteins are secreted from the cells. The smaller soluble fragments may be 

15 advantageous for use in certain in vitro assays. The soluble ork polypeptides may be 
employed to competitively bind the ligand in vivo, thus inhibiting signal transduction 
activity via endogenous cell surface bound ork proteins. Further, soluble proteins are 
generally more suitable for intravenous administration and may exert their desired effect 
(e.g., binding a ligand) in the bloodstream. 

20 Truncated ork proteins, including soluble polypeptides, may be prepared by any 

of a number of conventional techniques. In the case of recombinant proteins, a DNA 
fragment encoding a desired ork fragment may be subcloned into an expression vector. 
A desired DNA sequence may be chemially synthesized using known techniques. 
DNA fragments also may be produced by restriction endonuclease digestion of a full 

25 length cloned DNA sequence, and isolated by electrophoresis on agarose gels. Linkers 
containing restriction endonuclease cleavage site(s) may be employed to insert the 
desired DNA fragment into an expression vector, or the fragment may be digested at 
cleavage sites naturally present therein. 

In another approach, enzymatic treatment (e.g., using Bal 31 exonuclease) may 

30 be employed to delete terminal nucleotides from a DNA fragment to obtain a fragment 
having a particular desired terminus. Among the commercially available linkers are 
those that can be ligated to the blunt ends produced by Bal 31 digestion, and which 
contain restriction endonuclease cleavage site(s). Alternatively, oligonucleotides that 
reconstruct the N- or C-terminus of a DNA fragment to a desired point may be 

35 synthesized. The oligonucleotide may contain a restriction endonuclease cleavage site 
upstream of the desired coding sequence and position an initiation codon (ATG) at the 
N-terminus of the coding sequence. The well known polymerase chain reaction 
procedure also may be employed to amplify a DNA sequence encoding a desired 
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protein fragment. 3' and 5' oligonucleotide primers that anneal to the ork DNA at the 
termini of a desired fragment are employed in the PCR reaction which is conducted 
using any suitable procedure, such as those described in Sarki et al., Science 239:487 
(1988); in Recombinant DNA Methodology, Wu et al., eds., Academic Press Inc., San 
5 Diego (1989), pp. 189-196; and in PCR Protocols: A Guide to Methods and 

Applications, Innis et al., eds., Academic Press, Inc. (1990). An example of a suitable 
PCR procedure is as follows. All temperatures are in degrees centigrade. The 
following PCR reagents are added to a 0.5 ml Eppendorf microfuge tube: 10 jil of 10X 
PCR buffer (500 mM KC1, 100 mM Tris-HCl, pH 8.3 at 25°C, 25 mM MgCl 2 , and 1 

10 mg/ml gelatin) (Perkin-Elmer Cetus, Norwalk, CN), 8jil of a 2.5 mM solution 

containing each dNTP (2 mM dATP, 2 mM dCTP, 2 mM dGTP and 2 mM dTTP), 2.5 
units (0.5 pi of standard 5000 units/ml solution) of Taq DNA polymerase (Perkins- 
Elmer Cetus), 1 ng of template DNA, 100 picomoles of each of the oligonucleotide 
primers, and water to a final volume of 100 jxl. The final mixture is then overlaid with 

15 100 \i\ parafin oil. PCR is carried out using a DNA thermal cycler (Ericomp, San 

Diego, CA). The template is denatured at 94° for 5 minutes and PCR is carried out for 
25 cycles of amplification using a step program (denaturation at 94°, 1.5 minutes; 
annealing at 60°, 1 minute; extension at 72°, 1 minute). 

The present invention also provides full length ork protein or antigenic 

20 fragments thereof that can act as immunogens to generate antibodies specific to the ork 
immunogens. Monoclonal antibodies specific for ork or antigenic fragments thereof are 
prepared by prodedures that include those described in Example 4. The above- 
described procedures for producing ork fragments may be employed in producing ork 
fragments for use as immunogens. 

25 

Expression of Recombinant ork Proteins 

The present invention provides recombinant expression vectors to express DNA 
encoding the ork proteins of the present invention. The inventive recombinant 
expression vectors are replicable DNA constructs which contain a synthetic or cDNA- 

30 derived DNA sequence encoding an ork protein, operably linked to suitable 

transcriptional or translational regulatory elements. Examples of genetic elements 
having a regulatory role in gene expression include transcriptional promoters, operators 
or enhancers, a sequence encoding suitable mRNA ribosomal binding sites, and 
appropriate transcription and translation initiation and termination sequences. The 

35 ability to replicate in a host, usually conferred by an origin of replication, and a 
selection gene to facilitate recognition of transformants may additionally be 
incorporated. The regulatory elements employed in the expression vectors are generally 
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derived from mammalian, microbial, viral, or insect genes. Expression vectors derived 
from retroviruses also may be employed. 

DNA regions are operably linked when they are functionally related to each 
other. A DNA sequence encoding ork is said to be operably linked to one or more of 
5 the above-described regulatory elements when the ork DNA sequence is transcribed, or 
the resulting mRNA is translated, under the control of the regulatory element(s). 

Transformed host cells are cells which have been transformed or transfected 
with foreign DNA using recombinant DNA techniques. In the context of the present 
invention, the foreign DNA includes a sequence encoding the inventive ork protein. 
10 Host cells may be transformed for purposes of cloning or amplifying the foreign DNA, 
or may be transformed with an expression vector for production of the fusion protein 
under the control of appropriate promoters. Suitable host cells include prokaryotes, 
yeast, or higher eukaryotic cells. Appropriate cloning and expression vectors for use 
with bacterial, fungal, yeast, and mammalian cellular hosts are described by Pouwels et 
15 al. {Cloning Vectors; A Laboratory Manual, Elsevier, New York, 1985), the relevant 
disclosures of which is hereby incorporated by reference. Cell-free translation systems 
could also be employed to produce fusion protein using RNAs derived from the DNA 
constructs of the present invention. 

Prokaryotes include gram negative or gram positive organisms. Prokaryotic 
20 expression vectors generally comprise one or more phenotypic selectable markers, for 
example a gene encoding proteins conferring antibiotic resistance or supplying an 
autotrophic requirement, and an origin of replication recognized by the host to ensure 
amplification within the host. Examples of suitable prokaryotic hosts for 
transformation include E. coli, bacilli such as Bacillus subtilis, Salmonella 
25 typhimurium, and various species within the genera Pseudomonas, Streptomyces, and 
Staphylococcus, although others may also be employed as a matter of choice. 

Useful expression vectors for bacterial use can comprise a selectable marker and 
bacterial origin of replication derived from commercially available plasmids comprising 
genetic elements of the well-known cloning vector pBR322 (ATCC 37017). Such 
30 commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, 
Uppsala, Sweden) and pGEMl (Promega Biotec, Madison, WI, USA). These 
pBR322 "backbone' 1 sections are combined with an appropriate promoter and the 
structural sequence to be expressed. E, coli is typically transformed using derivatives 
of pBR322, a plasmid derived from an £. coli species (Bolivar et al., Gene 2:95, 
35 1977). pBR322 contains genes for ampicillin and tetracycline resistance, providing 
simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include 
the S-lactamase (penicillinase) and lactose promoter system (Chang et al., Nature 
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275:615, 1978; and Goeddel et al., Nature 257:544, 1979), the tryptophan (trp) 
promoter system (Goeddel et al., Nucl. Acids Res. 5:4057, 1980; and EPA 36,776) 
and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, p. 412, 1982). A particularly useful bacterial expression system 
employs the phage X Pl promoter and cI857ts thermoinducible repressor. Plasmid 
vectors available from the American Type Culture Collection which incorporate 
derivatives of the X Pl promoter include plasmid pHUB2, resident in E. coli strain 
JMB9 (ATCC 37092) and pPLc28, resident in E. coli RR1 (ATCC 53082). 

The recombinant ork protein may also be expressed in yeast hosts, preferably 
from Saccharomyces species, such as S. cerevisiae. Yeast of other genera such as 
Pichia or Kluyveromyces may also be employed. Yeast vectors will generally contain 
an origin of replication from the 2(im yeast plasmid or an autonomously replicating 
sequence (ARS), a promoter, DNA encoding the ork protein, sequences for 
polyadenylation and transcription termination and a selection gene. Yeast vectors may 
include origins of replication and selectable markers permitting transformation of both 
yeast and E. coli, e.g., the ampicillin resistance gene of E. coli and the S. cerevisiae 
trpl gene. The trp 1 gene provides a selection marker for a mutant strain of yeast 
lacking the ability to grow in tryptophan, and a promoter derived from a highly 
expressed yeast gene to induce transcription of a structural sequence downstream. The 
presence of the trpl lesion in the yeast host cell genome then provides an effective 
environment for detecting transformation by growth in the absence of tryptophan. 

Suitable promoter sequences in yeast vectors include the promoters for 
metallothionein, 3-phosphoglycerate kinase (Hitzeman et ah, J. BioL Chem. 255:2073, 
1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7:149, 1968; and 
Holland et al., Biochem. 77:4900, 1978), such as enolase, glyceraldehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucoses- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase and glucokinase. Examples of suitable vectors 
and promoters for use in yeast expression are further described in R. Hitzeman et al, 
EPA 73,657. 

Preferred yeast vectors can be assembled using DNA sequences from pBR322 
for selection and replication in E. coli (Amp r gene and origin of replication) and yeast 
DNA sequences including a glucose-repressible ADH2 promoter and a-factor 
secretion leader. The ADH2 promoter has been described by Russell et al. (J. BioL 
Chem. 253:2674, 1982) and Beier et al., {Nature 300:124, 1982). Advantageously, a 
DNA segment encoding a leader sequence functional in yeast is operably linked to the 
5' end of the DNA encoding the ork protein. The encoded leader peptide promotes 
secretion of the ork protein from the host cell and is generally cleaved from the ork 
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protein upon secretion. As one example, the yeast a-factor leader, which directs 
secretion of heterologous proteins, can be inserted between the promoter and the 
structural gene to be expressed. See, e.g., Kurjan et al., Cell 30:922, 1982; and Bitter 
et al., Proc. Natl. Acad. Sci. USA 87:5330, 1984. The leader sequence may be 
5 modified to contain, near its 3' end, one or more useful restriction sites to facilitate 
fusion of the leader sequence to foreign genes. 

Suitable yeast transformation protocols are known to those of skill in the art. 
An exemplary technique is described by Hinnen et al., Proc. Natl. Acad. Sci. USA 
75:1929, (1978), selecting for Trp+ transformants in a selective medium consisting of 
10 0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, lOjig/ml adenine and 
20 |ig/ml uracil. Host strains transformed by vectors comprising the above-described 
ADH2 promoter may be grown for expression in a rich medium consisting of 1% yeast 
extract, 2% peptone, and 1 % glucose supplemented with 80 fag/ml adenine and 80 
jig/ml uracil. Derepression of the ADH2 promoter occurs upon exhaustion of medium 
15 glucose. Crude yeast supernatants are harvested by filtration and held at 4°C prior to 
further purification. 

Various mammalian or insect cell culture systems can be employed to express 
recombinant protein. Baculovirus systems for production of heterologous proteins in 
insect cells are reviewed by Luckow and Summers, Bio/Technology 6:47 (1988). 
20 Established cell lines of mammalian origin may be employed. Examples of suitable 

mammalian host cell lines include the COS-7 line of monkey kidney cells (ATCC CRL 
1651), described by Gluzman (Cell 23:175, 1981), CV-1 cells (ATCC CCL 70) also 
derived from monkey kidney, L cells, C127, 3T3, Chinese hamster ovary (CHO), 
HeLa and BHK cell lines. Mammalian expression vectors may comprise non- 
25 transcribed elements such as an origin of replication, a suitable promoter and enhancer 
linked to the gene to be expressed, and other 5' or 3' flanking nontranscribed 
sequences, and 5' or 3' nontranslated sequences, such as necessary ribosome binding 
sites, a poly-adenylation site, splice donor and acceptor sites, and transcriptional 
termination sequences. 

30 The transcriptional and translational control sequences in expression vectors to 

be used in transforming vertebrate cells may be provided by viral sources. For 
example, commonly used promoters and enhancers are derived from Polyoma, 
Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. DNA sequences 
derived from the SV40 viral genome, for example, SV40 origin, early and late 

35 promoter, enhancer, splice, and polyadenylation sites may be used to provide the other 
genetic elements required for expression of a heterologous DNA sequence. The early 
and late promoters are particularly useful because both are obtained easily from the 
virus as a fragment which also contains the S V40 viral origin or replication (Tiers et al., 
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Nature 273: 1 1 3, 1978). Smaller or larger SV40 fragments may also be used, provided 
the approximately 250 bp sequence extending from the Hind III site toward the BgR 
site located in the viral origin of replication is included. Exemplary vectors can be 
constructed as disclosed by Okayama and Berg (MoL Cell Biol. J.280, 1983). A 
5 useful system for stable high level expression of mammalian receptor cDNAs in CI 27 
murine mammary epithelial cells can be constructed substantially as described by 
Cosman et al. (MoL Immunol. 23:935, 1986). Other expression vectors for use in 
mammalian host cells are derived from retroviruses. 



10 Producing and Purifying the ork Protein 

The present invention provides substantially homogeneous ork protein, which 
may be produced by recombinant expression systems as described above or purified 
from naturally occurring cells. The ork protein is purified to substantial homogeneity, 
as indicated by a single protein band upon analysis by SDS-polyacrylamide gel 

15 electrophoresis (SDS-PAGE). 

In one embodiment of the present invention, ork is purified from a cellular 
source using any suitable protein purification technique. The tissues identified in 
example 3 as containing ork RNA (preferably placental or lung tissue) from a 
mammalian species of interest may be employed as sources of ork, for example. 

20 An alternative process for producing the recombinant ork protein of the present 

invention comprises culturing a host cell transformed with an expression vector 
comprising a DNA sequence that encodes said ork protein under conditions that 
promote expression of the ork protein, which is then purified from culture media or cell 
extracts. Any suitable purification process may be employed, with the procedure of 

25 choice varying according to such factors as the type of host cells and whether or not the 
desired protein is secreted from the host cells. The fusion protein will be secreted into 
the culture medium when it is initially fused to a signal sequence or leader peptide 
operative in the host cells, or when the protein comprises soluble forms of the ork 
polypeptides. 

30 For example, supernatants from expression systems which secrete recombinant 

protein into the culture medium can be first concentrated using a commercially available 
protein concentration filter, e.g., an Amicon or Millipore Pellicon ultrafiltration unit. 
Following the concentration step, the concentrate can be applied to a suitable 
purification matrix. For example, an immunoaffinity column comprising antibodies 

35 directed against ork and bound to a suitable support may be employed. A monoclonal 
antibody specific for ork may be prepared as described in example 4. Alternatively, an 
anion exchange resin can be employed, for example, a matrix or substrate having 
pendant diethylaminoethyl (DEAE) groups. The matrices can be acrylamide, agarose, 
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dextran, cellulose or other types commonly employed in protein purification. 
Alternatively, a cation exchange step can be employed. Suitable cation exchangers 
include various insoluble matrices comprising sulfopropyl or carboxymethyl groups. 
Sulfopropyl groups are preferred. One or more reversed-phase high performance 
5 liquid chromatography (RP-HPLC) steps employing hydrophobic RP-HPLC media, 
e.g., silica gel having pendant methyl or other aliphatic groups, can be employed to 
further purify ork. 

Recombinant protein produced in bacterial culture is usually isolated by initial 
extraction from cell pellets, followed by one or more concentration, salting-out, 

10 aqueous ion exchange or size exclusion chromatography steps. Finally, high 

performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of recombinant fusion proteins can 
disrupted by any convenient method, including freeze-thaw cycling, sonication, 
mechanical disruption, or use of cell lysing agents. 

15 Fermentation of yeast which express ork as a secreted protein greatly simplifies 

purification. Secreted recombinant protein resulting from a large-scale fermentation can 
be purified by methods analogous to those disclosed by Urdal et al. (J. Chromatog. 
296:\ll, 1984), involving two sequential, reversed-phase HPLC steps for purification 
of a recombinant protein on a preparative HPLC column. 

20 Some or all of the foregoing purification steps, in various combinations, can be 

employed to provide an essentially homogeneous recombinant protein. Recombinant 
cell culture enables the production of the ork protein free of those contaminating 
proteins which may be normally associated with ork as it is found in nature, e.g., in 
cells, cell exudates or body fluids. The foregoing purification procedures are among 

25 those that may be employed to purify non-recombinant ork proteins of the present 
invention as well. 

Variants and Derivatives of ork 

Variants and derivatives of native ork proteins that retain the desired biological 
30 activity are also within the scope of the present invention. An ork variant, as referred 
to herein, is a polypeptide substantially homologous to a native ork, but which has an 
amino acid sequence different from that of native ork (human, murine or other 
mammalian species) because of one or a plurality of deletions, insertions or 
substitutions. 

35 The variant amino acid sequence preferably is at least 80% identical to a native 

ork amino acid sequence, most preferably at least 90% identical. When the variant ork 
protein comprises extracellular, transmembrane and cytoplasmic domains, these percent 
identities apply to the entire sequence and also to the extracellular domain when taken 
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alone. Since the cytoplasmic domains are relatively conserved among members of the 
RTK family, the percent identity of the extracellular domain is important in identifying a 
variant as an ork protein. 

To illustrate this point, a comparison of the full length ork and tie amino acid 
5 sequences (aligned in Figure 5) reveals 76% identity for the cytoplasmic domains, 
whereas the percent identity drops to 47.5% for the full length sequences as a whole. 
The percent identity drops to 33% when just the extracellular domains are compared. A 
DNA probe corresponding to the extracellular region of tie would not hybridize to ork 
DNA under moderately stringent hybridization conditions. 
10 The degree of homology (percent identity) may be determined, for example, by 

comparing sequence information using the GAP computer program, version 6.0 
described by Devereux et al. (NucL Acids Res, 12:387, 1984) and available from the 
University of Wisconsin Genetics Computer Group (UWGCG). The GAP program 
utilizes the alignment method of Needleman and Wunsch (/. MoL BioL 48:443, 
15 1970), as revised by Smith and Waterman (Adv. Appl. Math 2:482, 1981). Briefly, 
the GAP program defines similarity as the number of aligned symbols (i.e., nucleotides 
or amino acids) which are similar, divided by the total number of symbols in the shorter 
of the two sequences. The preferred default parameters for the GAP program include: 
(1) a unary comparison matrix (containing a value of 1 for identities and 0 for non- 
20 identities) for nucleotides, and the weighted comparison matrix of Gribskov and 

Burgess, NucL Acids Res. 14:6745, 1986, as described by Schwartz and Dayhoff, 
eds., Atlas of Protein Sequence and Structure, National Biomedical Research 
Foundation, pp. 353-358, 1979; (2) a penalty of 3.0 for each gap and an additional 
0.10 penalty for each symbol in each gap; and (3) no penalty for end gaps. 
25 Alterations of the native amino acid sequence may be accomplished by any of a 

number of known techniques. Mutations can be introduced at particular loci by 
synthesizing oligonucleotides containing a mutant sequence, flanked by restriction sites 
enabling ligation to fragments of the native sequence. Following ligation, the resulting 
reconstructed sequence encodes an analog having the desired amino acid insertion, 
30 substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can 
be employed to provide an altered gene having particular codons altered according to 
the substitution, deletion, or insertion required. Exemplary methods of making the 
alterations set forth above are disclosed by Walder et al. (Gene 42:133, 1986); Bauer et 
35 al. (Gene 37:73, 1985); Craik (BioTechniques, January 1985, 12-19); Smith et al. 
(Genetic Engineering: Principles and Methods, Plenum Press, 1981); and U.S. Patent 
Nos. 4,518,584 and 4,737,462, which are incorporated by reference herein. 
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Variants may comprise conservatively substituted sequences, meaning that a 
given amino acid residue is replaced by a residue having similar physiochemical 
characteristics. Examples of conservative substitutions include substitution of one 
aliphatic residue for another, such as He, Val, Leu, or Ala for one another, or 
5 substitutions of one polar residue for another, such as between Lys and Arg; Glu and 
Asp; or Gin and Asn. Other such conservative substitutions, for example, substitutions 
of entire regions having similar hydrophobicity characteristics, are well known. 

ork also may be modified to create ork derivatives by forming covalent or 
aggregative conjugates with other chemical moieties, such as glycosyl groups, lipids, 

10 phosphate, acetyl groups and the like. Covalent derivatives of ork may be prepared by 
linking the chemical moieties to functional groups on ork amino acid side chains or at 
the N-terminus or C-terminus of an ork polypeptide or the extracellular domain 
thereof. Other derivatives of ork within the scope of this invention include covalent or 
aggregative conjugates of ork or its fragments with other proteins or polypeptides, 

15 such as by synthesis in recombinant culture as N-terminal or C-terminal fusions. For 
example, the conjugate may comprise a heterologous signal or leader polypeptide 
sequence at the N-terminus of an ork polypeptide. Examples of such signal peptides 
are the a-factor leader of Saccharomyces\ the signal sequence for interleukin-7 (IL-7) 
described in United States Patent 4,965,195; or the signal sequence for interleukin-2 

20 receptor described in United States Patent Application 06/626,667 filed on July 2, 
1984, The signal or leader peptide co-translationally or post-translationally directs 
transfer of the conjugate from its site of synthesis to a site outside of the cell membrane 
or cell wall. 

ork polypeptide fusions can comprise peptides added to facilitate purification 
25 and identification of ork . Such peptides include, for example, poly-His or the 

antigenic identification peptides described in U.S. Patent No. 5,011,912 and in Hopp 
et al., Bio/Technology 6:1204, 1988. One such peptide is the FLAG® peptide, Asp- 
Tyr-Lys-Asp-Asp-Asp-Asp-Lys (DYKDDDDK), which is highly antigenic and 
provides an epitope reversibly bound by a specific monoclonal antibody enabling rapid 
30 assay and facile purification of expressed recombinant protein. This sequence is also 
specifically cleaved by bovine mucosal enterokinase at the residue immediately 
following the Asp-Lys pairing. Fusion proteins capped with this peptide may also be 
resistant to intracellular degradation in £. coll A murine hybridoma designated 4E1 1 
produces a monoclonal antibody that binds the peptide DYKDDDDK in the presence of 
35 certain divalent metal cations (as described in U.S. Patent 5,01 1 ,912) and has been 
deposited with the American Type Culture Collection under accession no HB 9259. 

The present invention further includes ork polypeptides with or without 
associated native-pattern glycosylation. ork expressed in yeast or mammalian 
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expression systems (e.g., COS-7 cells) may be similar to or significantly different 
from a native ork polypeptide in molecular weight and glycosylation pattern, 
depending upon the choice of expression system. Expression of ork polypeptides in 
bacterial expression systems, such as E. coli, provides non-glycosylated molecules. 
5 DNA constructs that encode various additions or substitutions of amino acid 

residues or sequences, or deletions of terminal or internal residues or sequences not 
needed for biological activity or binding, can be prepared. For example, N- 
glycosylation sites in the ork extracellular domain can be modified to preclude 
glycosylation while allowing expression of a homogeneous, reduced carbohydrate 

10 analog using yeast expression systems. N-glycosylation sites in eukaryotic 

polypeptides are characterized by an amino acid triplet Asn-X-Y, wherein X is any 
amino acid except Pro and Y is Ser or Thr. Appropriate modifications to the nucleotide 
sequence encoding this triplet will result in substitutions, additions or deletions that 
prevent attachment of carbohydrate residues at the Asn side chain. Known procedures 

15 for inactivating N-glycosylation sites in proteins include those described in U.S. Patent 
5,071,972 and EP 276,846. In another example, sequences encoding Cys residues 
that are not essential for biological activity can be altered to cause the Cys residues to be 
deleted or replaced with other amino acids, preventing formation of incorrect 
intramolecular disulfide bridges upon renaturation. Other variants are prepared by 

20 modification of adjacent dibasic amino acid residues to enhance expression in yeast 

systems in which KEX2 protease activity is present. EP 212,914 discloses the use of 
site-specific mutagenesis to inactivate KEX2 protease processing sites in a protein. 

Naturally occurring ork variants are also encompassed by the present 
invention. Examples of such variants are proteins that result from alternative mRNA 

25 splicing events (since ork is encoded by a multi-exon gene) or from proteolytic 

cleavage of the ork protein, provided the desired biological activity (e.g., binding to an 
anti-ork antibody or to the ligand) is retained. Alternative splicing of mRNA may yield 
a truncated but biologically active ork protein, such as a naturally occurring soluble 
form of the protein, for example. An alternative splicing event in tie, an RTK of the 

30 same subgroup as ork, yielded a protein lacking the first of three fibronectin type III 
repeats. Variations attributable to proteolysis include, for example, differences in the 
N- or C-termini upon expression in different types of host cells, due to proteolytic 
removal of one or more terminal amino acids (which may occur intracellular^ or during 
purification). Varying N-termini may also result from cleavage of the signal peptide in 

35 certain host cells at a point other than between amino acids 18 and 19 of the disclosed 
sequence. 

In certain host cells, post-translational processing will remove the methionine 
residue encoded by an initiation codon, whereas the methionine residue will remain at 
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the N-terminus of proteins produced in other host cells. The N-terminal amino acid 
may, for example, be any of the amino acids at positions 1 to 5 of SEQ ID NO: 1 (for 
proteins comprising a signal peptide) or 19-23 (for the mature protein). The C- 
terminus may be truncated deliberately during expression vector construction (e.g., in 
5 constructing vectors encoding soluble proteins as described above) or as a result of 
differential processing which may remove up to about five C-terminal amino acids, for 
example. 

Due to the known degeneracy of the genetic code wherein more than one codon 
can encode the same amino acid, a DNA sequence may vary from that presented in SEQ 
10 ID NO:l, and still encode an ork protein having the amino acid sequence of SEQ ID 
NO:l. Such variant DNA sequences may result from silent mutations (e.g., occurring 
during PCR amplification), and may be the product of deliberate mutagenesis of a 
native sequence. 

Nucleic acid sequences within the scope of the present invention include 

15 isolated DNA and RNA sequences that hybridize to the ork nucleotide sequences 

disclosed herein under conditions of moderate or severe stringency, and which encode 
biologically active ork. Moderate stringency hybridization conditions refer to 
conditions described in, for example, Sambrook et al. Molecular Cloning: A 
Laboratory Manual, 2 ed. Vol. 1, pp. 1.101-104, Cold Spring Harbor Laboratory 

20 Press, (1989). Conditions of moderate stringency, as defined by Sambrook et al., 

include use of a prewashing solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0) 
and hybridization conditions of about 55*C, 5 X SSC, overnight. Conditions of severe 
stringency include higher temperatures of hybridization and washing. The skilled 
artisan will recognize that the temperature and wash solution salt concentration may be 

25 adjusted as necessary according to factors such as the length of the probe. 

The present invention thus provides isolated DNA sequences encoding 
biologically active ork , selected from: (a) DNA derived from the coding region of a 
native mammalian ork gene (e.g., cDNA derived from the coding region of the human 
ork cDNA sequence presented in SEQ ED NO: 1 ); (b) DNA capable of hybridizing 

30 under moderately stringent conditions to a DNA derived from the extracellular region of 
the sequence presented in SEQ ID NO: 1, and which encodes biologically active ork ; 
and (c) DNA which is degenerate as a result of the genetic code to a DNA defined in 
(a) or (b) and which encodes biologically active ork. 

ork polypeptides in the form of oligomers such as dimers or trimers are within 

35 the scope of the present invention. Oligomers may be linked by disulfide bonds formed 
between cysteine residues on different ork polypeptides. In one embodiment of the 
invention, an ork dimer is created by fusing ork to the Fc region of an antibody (IgGl). 
The Fc polypeptide preferably is fused to the C-terminus of a soluble ork (comprising 
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only the extracellular domain). A gene fusion encoding the ork fusion protein is 
inserted into an appropriate expression vector. The ork fusion proteins are allowed to 
assenble much like antibody molecules, whereupon interchain disulfide bonds form 
between Fc polypeptides, yielding divalent orL If fusion proteins are made with both 
5 heavy and light chains of an antibody, it is possible to form an ork oligomer with as 
many as four ork extracellular regions. Alternatively, one can link two soluble ork 
domains with a peptide linker such as the Gly4SerGly5Ser linker sequence described in 
United States Patent 5,073,627. A fusion protein comprising two or more ork 
polypeptides (with or without peptide spacers) may be produced by recombinant DNA 
10 technology. 

The present invention further provides antisense or sense oligonucleotides 
comprising a single-stranded nucleic acid sequence (either RNA or DNA) capable of 
binding to target ork mRNA (sense) or ork DNA (antisense) sequences. Antisense or 
sense oligonucleotides, according to the present invention, comprise a fragment of the 

15 coding region of ork cDNA. Such a fragment generally comprises at least about 14 
nucleotides, preferably from about 14 to about 30 nucleotides. The ability to create an 
antisense or a sense oligonucleotide, based upon a cDNA sequence for a given protein 
is described in, for example, Stein and Cohen, Cancer Res. 48:2659, 1988 and van der 
Krol et al., BioTechniques 6:958, 1988. 

20 Binding of antisense or sense oligonucleotides to target nucleic acid sequences 

results in the formation of duplexes that block translation (RNA) or transcription 
(DNA) by one of several means, including enhanced degradation of the duplexes, 
premature termination of transcription or translation, or by other means. The antisense 
oligonucleotides thus may be used to block expression of ork proteins. Antisense or 

25 sense oligonucleotides further comprise oligonucleotides having modified sugar- 
phosphodiester backbones (or other sugar linkages, such as those described in 
WO91/06629) and wherein such sugar linkages are resistant to endogenous nucleases. 
Such oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of 
resisting enzymatic degradation) but retain sequence specificity to be able to bind to 

30 target nucleotide sequences. Other examples of sense or antisense oligonucleotides 

include those oligonucleotides which are covalently linked to organic moieties, such as 
those described in WO 90/10448, and other moieties that increase affinity of the 
oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further 
still, intercalating agents, such as ellipticine, and alkylating agents or metal complexes 

35 may be attached to sense or antisense oligonucleotides to modify binding specificities of 
the antisense or sense oliginucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the 
target nucleic acid sequence by any gene transfer method, including, for example, 
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CaP04-mediated DNA transfection, electroporarion, or other gene transfer vectors such 
as Epstein-Barr virus. Antisense or sense oligonucleotides are preferably introduced 
into a cell containing the target nucleic acid sequence by insertion of the antisense or 
sense oligonucleotide into a suitable retroviral vector, then contacting the cell with the 
5 retrovirus vector containing the inserted sequence, either in vivo or ex vivo. Suitable 
retroviral vectors include, but are not limited to, the murine retrovirus M-MuLV, N2 (a 
retrovirus derived from M-MuLV), or or the double copy vectors designated DCT5A, 
DCT5B and DCT5C (see PCT Application US 90/02656). Alternatively, other 
promotor sequences may be used to express the oligonucleotide. 

10 Sense or antisense oligonucleotides may also be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand 
binding molecule, as described in WO 91/04753. Suitable ligand binding molecules 
include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. 

1 5 Alternatively, a sense or an antisense oligonucleotide may be introduced into a 

cell containing the target nucleic acid sequence by formation of an oligonucleotide-lipid 
complex, as described in WO 90/10448. The sense or antisense oligonucleotide-lipid 
complex is preferably dissociated within the cell by an endogenous lipase. 
Uses of Ork Proteins 

20 One use of ork is as a research tool for identifying the ligand that binds thereto 

and studying the biological effects of ligand binding. The ork polypeptides of the 
present invention also may be employed in in vitro assays for detection of ork or its 
ligand or the interactions thereof. 

The ork polypeptides of the present invention can be used in a binding assay to 

25 detect cells expressing a ligand for ork. For example, ork or an extracellular domain or 
a fragment thereof can be conjugated to a detectable moiety such as ^5\ t 
Radiolabeling with ^5j can b e performed by any of several standard methodologies 
that yield a functional 125j or ^ molecule labeled to high specific activity. Alternatively, 
another detectable moiety such as an enzyme that can catalyze a colorometric or 

30 fluorometric reaction, biotin or avidin may be used. Cells to be tested for ork ligand 
expression can be contacted with the labeled ork. After incubation, unbound labeled 
ork is removed and binding is measured using the detectable moiety. 

Soluble ork polypeptides may be employed to competitively bind the ligand in 
vivo, thus inhibiting signal transduction activity via endogenous cell surface bound ork 

35 proteins. Further, soluble proteins are generally more suitable for intravenous 
administration and may exert their desired effect (e.g. binding a ligand) in the 
bloodstream. Soluble ork proteins comprising only the extracellular (ligand-binding) 
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domain lack the tyrosine kinase domain which is located within the cytoplasmic 
domain. 

The ork polypeptides disclosed herein are also useful in generating antibodies 
specific to the ork immunogens. Monoclonal antibodies specific for ork or antigenic 
5 fragments thereof may be prepared by procedures that include those described in 

Example 4. The monoclonal antibodies can be attached to insoluble support materials 
for use in immunoaffinity column purification of ork proteins. Other uses for the 
antibodies include detecting ork proteins in in vitro assays and identifying and purifying 
additional ork polypeptides such as variants (e.g., from alternative splicing events) that 
10 comprise the region from which the immunogen was derived. 

EXAMPLE 1: PCR-Based Cloning of Human orkcDNA 
Degenerate oligonucleotide primers were synthesized corresponding to the 
conserved sequences HRDLAA (TK-1; sense orientation) and SDVWS (TK-2; 
15 antisense orientation) contained within the kinase domain of all RTKs (Hanks, et al., 
1988). The oligonucleotides corresponding to the conserved motifs HRDLAA and 
SDVWS were 

(S'-CACC/aAGG/aGAC/t^tTGGCA/tGC-S') and 

(5 , -AG/ A GACCA A ^ r /c/GACG/ A TC G /ACT-3 t ), respectively. Both oligonucleotides 

20 were 32-fold degenerate and contained recognition sequences for Xho I. The 

HRDLAA sequence was chosen to favor RTKs over src-family tyrosine kinases that 
have the sequence HRDLRA (Hanks et al., 1988). 

Single-stranded cDNA was synthesized from human placental polyadenylated 
RNA by standard methods. The single stranded cDNA was used as template for a PCR 

25 reaction using the conditions of Wilks (1989). The two degenerate oligonucleotides 
were used as primers in the PCR reaction. Amplified PCR reaction products of about 
200 base pairs were digested with Xho I and ligated into an Xho I-digested plasmid 
vector desinated pBLUESCRIPT® SK. This vector, available from Stratagene 
Cloning Systems, La Jolla, California, is replicable in £. coli and contains apolylinker 

30 segment that includes 21 unique restriction sites, one of which is Xho I. The ligation 
mixture was transformed into E. coli cells by conventional procedures. 

Recombinant plasmids were recovered from the transformed E. coli cells and 
the nucleotide sequence of the DNA insert in a number of individual clones was 
determined. Among the known RTKs that were represented in the PCR-library were c- 

35 fms, JTK-4 (Partanen et aL Proc. NatL Acad. ScL USA, 87:8913-17, 1990), kdr 

(Terman et al., Oncogene, 6:1677-1683, 1991), eph (Hirai et al, 1987 supra), and fit- 
4 (Aprelikova et al. Cancer Res., 52:746-748, 1992). However, one clone, 
designated HPK-6, contained a novel sequence. The approximately 200bp cDNA 
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insert of HPK-6 was isolated and radiolabeled with 32 P by conventional techniques for 
use as a probe to isolate longer cDNA sequences encoding the novel protein. 

An oligo-dT-primed human placental cDNA library in plasmid pDC302 has 
been previously described (Larsen et al, J. Exp. Med., 172:1559-1570, 1990). 
5 Briefly, total cell RNA was isolated from whole fresh placental tissue and 

polyadenylated RNA was prepared by chromatography on oligo(dT)-cellulose. 
Double-stranded, oligo(dT)-primed cDNA was prepared with a commercial kit 
(Amersham Corp., Arlington Heights, IL). The resulting cDNA was size fractionated 
by chromatography on Sephacryl S-1000 (Pharmacia Fine Chemicals, Piscataway, 
10 NJ) in 0.5 M sodium acetate. The excluded cDNA was cloned into the Bgfll site of the 
mammalian expresion vector, pDC302 (described below) by an adaptor method similar 
to that described by Haymerle et al. (NucL Acids Res. 14:8615, 1986). Briefly, 
noncomplementary oligonucleotides of the sequence 

5'~GATCTTGGAACGAGACGACCTGCT and S'-AGCAGGTCGTCTCGTTCCAA 

1 5 synthesized on a DNA synthesizer (model 380A; Applied Biosystems, Foster City, 

CA) were annealed and ligated in separate reactions to either cDN A or BgRl cut vector. 
Nonligated oligonucleotides were separated from cDNA or vector by chromatography 
over Sepharose CL-2B (Pharmacia Fine Chemicals) at 65°C in lOmM Tris (pH 8,0), 
O.lmM EDTA. 5 ng of adaptored vector was ligated to adaptored cDNA in 10-jo.l 

20 reactions containing 50 mM sodium chloride, 50 mM Tris-HCl (pH 7.5), 10 mM 
magnesium chloride, 1 mM spermidine, 0.5 mM ATP, 0.1 U/[il T4 polynucleotide 
kinase and 0.4 U/|Lil T4 DNA ligase for 30 min at 37 8 C. Reactions were then desalted 
by drop dialysis on VSWP 013 filters (Millipore Corp., Bedford, MA) against distilled 
water for 40 min immediately before electroporation into Escherichia coli strain DH5a 

25 Transformants were obtained with an average cDNA insert size of 1.6 kb. 

The pDC302 expression vector employed in preparing this cDNA library has 
been described by Mosley et al. {Cell 59:335, 1989). pDC302 is an expression vector 
for use in mammalian host cells, but also replicates in E. coli. 

pDC3()2 was assembled from pDC201 (Sims et al., Science 241:585, 1988), 

30 SV40 and cytomegalovirus DNA and comprises, in order with the direction of 
transcription from the origin of replication: (1) SV40 sequences from coordinates 
5171 -270 including the origin of replication, enhancer sequences and early and late 
promoters; (2) human cytomegalovirus sequences including the promoter and enhancer 
regions (nucleotides 67 1 to 63 from the sequence published by Boechart et al. (Cell 

35 41:521, 1985); (3) adenovirus-2 sequences containing the first exon and part of the 
intron between the first and second exons of the tripartite leader, the second exon and 
pan of the third exon of the tripartite leader; and (4) a multiple cloning site (MCS) 
containing sites for Xhol, Asp7l8, Smal, NotI and Bglll; (5) SV40 sequences from 
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coordinates 4127-4100 and 2770-2533 that include the polyadenylation and termination 
signals for early transcription; (6) sequences derived from pBR322 and virus- 
associated sequences VAI and VAII of pDC201, with adenovirus sequences 10532- 
1 1 156 containing the VAI and VAII genes, followed by pBR322 sequences from 4363- 
5 2486 and 1094-375 containing the ampicillin resistance gene and origin of replication. 
Miniprep DNA isolated from individual pools of colonies from the above- 
described placental cDNA library was digested with Bgl II (which excises the cDNA 
inserts) and screened for hybridization to the HPK-6-derived probe. A clone 
comprising a 2.3 Kb cDNA insert was isolated from one pool (#398) and the insert was 

10 radiolabeled for use in subsequent library screens. 

A random-primed cDNA library in XgtlO was generated from poly(A) + human 
placental RNA and screened for hybridization to the 2.3 kb cDNA probe. One clone 
(32-1), about 4.0 Kb in length, contained an entire coding region, as well as 5' and 3' 
untranslated sequences. 

15 A DNA sequence for human ork is presented in SEQ ID NO:l, along with the 

amino acid sequence encoded thereby. The SEQ ID NO:l sequence was derived by 
combining DNA sequencing information from the fully-sequenced 2.3kbclone with 
that obtained for the 4.0" kb clone (for which certain portions overlapping with the 2.3 
kb clone were not fully sequenced). Beginning with an ATG codon at nucleotides 149- 

20 151 there is an open reading frame extending for 1 124 amino acids. Several lines of 
evidence suggest that this methionine codon is the initiating codon. The ATG codon is 
in a proper context (Kozak, 1984), and is followed by a hydrophobic stretch of amino 
acids that resembles a leader sequence (amino acids 1-18 of SEQ ID NO: 1, i.e., 
methionine through glycine). There are two in-frame termination codons upstream of 

25 the ATG codon. The protein comprises an N- terminal extracellular (ligand-binding) 
domain (amino acids 19-745), followed by a transmembrane region comprising amino 
acids 746-772, and a C-terminal cytoplasmic domain (which contains the tyrosine 
kinase domain responsible for the tyrosine phosphorylating activity of the protein) 
comprising amino acids 773-1 124. 

30 The 2.3 kb clone encodes a C-terminal fragment of ork , including a small C- 

terminal portion of the extracellular domain, followed by the complete transmembrane 
and cytoplasmic domains of the protein (amino acids 698-1 124). The 2.3 kb ork 
cDNA extends from nucleotide 2240 to the 3' end of the SEQ ID NO:l DNA sequence 
and contains additional 3' non-coding sequences (i.e., the 2.3 kb clone has a longer 3' 

35 non-coding region than does the 4.0 kb clone depicted in SEQ ID NO: 1 ). Since the 2.3 
kb clone lacks most of the extracellular domain, the tyrosine kinase encoded thereby is 
not expected to bind a ligand. 
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The 4.0 kb ork cDNA in plasmid pBLUESCRIPT®SK in £. coli strain DH5a 
was deposited with the American Type Culture collection on May 28, 1992 and was 
given accession number ATCC 69003. The strain deposit was made under the terms of 
the Budapest Treaty. 

5 

EXAMPLE 2: Characterization of ork Gene Product 
Rabbit polyclonal antiserum was generated against a peptide consisting of the 
carboxyl -terminal 21 amino acids of the ork protein cytoplasmic domain (SEQ ID NO:l 
sequence) conjugated to ovalbumin (Kitagawa and Aikawa, J. BiochemJ9:233-236 i 

10 1976). Animals were immunized with 100 (ig of the conjugate in complete Freund's 
adjuvant, followed by three boosts in incomplete adjuvant, after which time test bleeds 
were taken and assayed for immunoreactivity towards the peptide immobilized on 
nitrocellulose. Bleeds that showed anti-peptide activity were used in the following 
immunoprecipitation experiment. 

15 To examine the protein encoded by the ork cDNA, the full length cDNA was 

excised from Xgt\0 by Not I digestion and ligated into a Not I-digested expression 
plasmid pDC302 (described above). Not I cleaves pDC302 at a unique site within a 
multiple cloning site. The resulting recombinant vectors were transfected into COS-7 
cells and 3 days later the cells were labelled with 35$-met/cys and detergent solubilized. 

20 The 35 S-labeled lysates were incubated with 5 |j,l of the anti-peptide serum for 1 hour at 
4°C, after which time 300 of pre-swollen proteinA-sepharose was added and the 
lysates were incubated for an additional hour at 4°C. The immune complexes were 
washed 3 times with PBS/1% Triton X-100, boiled in SDS-PAGE sample buffer, and 
analyzed by SDS/polyacrylamide gel electrophoresis. The gel was treated with Amplify 

25 (Amersham, Arlington Heights, IL) and exposed to X-Ray film. 

As shown in Figure 3 (panel A), the immune serum, but not the pre-immune 
serum, precipitated an approximately 140 Kd protein in the lysates from the ork- 
transfected cells, but not in lysates from control cells. Preincubation of the immune 
serum with the peptide specifically blocked the precipitation of the 140 Kd protein. 

30 A hallmark of protein tyrosine kinases is their ability to auto-phosphorylate in 

vitro in immune complexes (Ullrich and Schlessinger, 1990)). We tested the ability of 
the ork protein to autophosphorylate using the rabbit serum described above. Ork- or 
mock-transfected COS-7 cells were washed IX in cold PBS/0. ImM sodium 
orthovanadate and lysed at 10? cells/ml in a buffer containing: 25mM TRIS pH 

35 8/150mM NaCl/lmM EGTA/lmM DTT/1% NP-40/0.1mM sodium 

orthovanadate/1 mM PMSF/10 |Ug/ml leupeptin/10 |ag/ml pepstatin A. Lysates were 
incubated with 5 p.1 of the above-described rabbit anti-peptide serum or normal rabbit 
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serum for 2 hours at 4°C with 15 |il (packed volume) protein A-sepharose. Immune 
complexes were washed 3X with 25mM TRIS pH 8/150mM NaCl/1% Triton X- 
100/0.1% SDS/1% sodium deoxycholate/lmM DTT/OJmM sodium 
orthovanadate/lmM PMSF/ 10 |ig/ml leupeptin, followed by three washes in 20mM 
5 HEPES pH 7.4/1 OmM MnCl2/5mM MgCl2- In vitro phosphorylation reactions were 
initiated by suspending the beads to a final volume of 50 |il in a mixture containing fy- 
32p]ATP (25mM)/10mM MnCl2/5mM MgCl2/20mM HEPES pH 7.4. After 
incubation for 30 minutes at 37°C the beads were washed with 20mM HEPES pH 
7.4/10mM ATP/5mM EGTA, boiled in SDS-PAGE sample buffer and separated on 8- 

10 16% SDS/polyacrylamide gels followed by autoradiography. For phosphoaminoacid 
analysis, separated proteins were electrophoretically transferred to membranes and 
regions containing 32p-labelled protein were excised and incubated with 6N HC1 for 1 
hour at 1 10°C. Hydroly sates were analysed by two dimensional thin-layer 
electrophoresis (Cooper et al., 1983) and autoradiography. 

15 As shown in Figure 3, panel B, immunoprecipitated ork was capable of auto- 

phosphorylation. Phospho-amino acid analysis demonstrated that only phospho- 
tyrosine was present in the immunoprecipitates (Fig. 3, panel C). We also examined 
whether the ork expressed in the COS cells was phosphorylated on tyrosine residues. 
Lysates from ork- and control-transfected cells were immunoprecipated with the PI 

20 (polyclonal) serum, separated by SDS-PAGE, transferred to filters and incubated with 
an anti-phosphotyrosine anti-serum. As shown in Figure 3, panel D, a 140 Kd protein 
containing phosphotyrosine was seen only in the lysates from orfc-transfected cells that 
had been immunoprecipitated with the anti-or* serum. It is unclear from these data 
whether ork has a high level of intrinsic tyrosine kinase activity, whether the COS cells 

25 express a potential ligand for ork which is interacting with the ork protein on the COS 
cell surface, or whether ork is being phosphorylated by a tyrosine kinase expressed by 
COS cells. 

EXAMPLE 3: Screening Human Tissues for ork Expression 
RNAs from several human tissues were screened for ork transcripts. For ork 
30 expression, a filter containing 2|ig of poly(A) + RNA from various human tissues 
(Clontech, San Diego, CA) was hybridized with an antisense RNA probe containing 
nucleotides 2240-3005 (outside of the kinase domain). As shown in Figure 4, a 4.5 
kb mRNA was found at highest levels in placenta and lung, with lower levels detected 
in kidney and heart. Each of these tissues is highly vascularized, suggesting that ork 
35 may be expressed in endothelial cells. Consistent with this, umbilical cord vein 
endothelial cells (HU VECs) also expressed ork message. In addition to the tissues 
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listed in Figure 4, RNAs isolated from hematopoietic cell lines and tissues were 
surveyed, but ork mRNA was not detected. 

EXAMPLE 4: Monoclonal Antibodies Directed Against ork 
5 This example illustrates the preparation of monoclonal antibodies to ork or 

immunogenic fragments thereof. Purified ork can be used to generate anti-ork 
monoclonal antibodies using conventional techniques, for example, those techniques 
described in U.S. Patent 4,411,993. 

In one embodiment of the invention, the peptide used to generate rabbit 

10 polyclonal antiserum in example 2 is used to produce a monoclonal antibody. The 

peptide comprises the.carboxy-terminal 21 amino acids of the ork protein (i.e., the last 
21 amino acids of the cytoplasmic domain in the SEQ ED NO:l sequence) conjugated to 
ovalbumin. Briefly, mice are immunized with the immunogenic peptide emulsified in 
complete Freund's adjuvant, and injected in amounts ranging from 10-100 p,g 

15 subcutaneously or intraperitoneally. Ten to twelve days later, the immunized animals 
are boosted with additional immunogenic peptide emulsified in incomplete Freund's 
adjuvant. Mice are periodically boosted thereafter on a weekly to bi-weekly 
immunization schedule. Serum samples are periodically taken by retro-orbital bleeding 
or tail-tip excision for testing by dot blot assay or ELISA (Enzyme-Linked 

20 Immunosorbent Assay), for ami-ark antibodies. 

Following detection of an appropriate antibody titer, positive animals are 
provided one last intravenous injection of the immunogen in saline. Three to four days 
later, the animals are sacrificed, spleen cells harvested, and spleen cells are fused to a 
murine myeloma cell line (e.g., NS1 or Ag 8.653). Fusions generate hybridoma cells, 

25 which are plated in multiple microtiter plates in a HAT (hypoxanthine, aminopterin and 
thymidine) selective medium to inhibit proliferation of non-fused cells, myeloma 
hybrids, and spleen cell hybrids. 

The hybridoma cells are screened by ELISA for reactivity against purified ork 
peptide by adaptations of the techniques disclosed in Engvall et al., Immunochem. 

30 8:871, 1971 and in U.S. Patent 4,703,004. Positive hybridoma cells can be injected 
intraperitoneally into syngeneic BALB/c mice to produce ascites containing high 
concentrations of ami-ork monoclonal antibodies. Alternatively, hybridoma cells can 
be grown in vitro in flasks or roller bottles by various techniques. Monoclonal 
antibodies produced in mouse ascites can be purified by ammonium sulfate 

35 precipitation, followed by gel exclusion chromatography. Alternatively, affinity 

chromatography based upon binding of antibody to protein A or protein G can also be 
used, as can affinity chromatography based upon binding to ork. 
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EXAMPLE 5: Cloning of Murine ork cDNA 
Murine ork cDNA was isolated by cross-species hybridization using human ork 
cDNA as a probe. A murine cDNA library was prepared from 12 1/2 day old mouse 
embryos. The library was screened by conventional techniques using the radiolabeled 
5 2.3 Kb human ork cDNA described in example 1 as a probe. 

The DNA sequence of the cDNA insert of a hybridizing clone was determined, 
and the amino acid sequence encoded thereby is presented in SEQ ID NO:3 and in 
Figure 6. The murine ork sequence (designated "m"; in the top rows) is aligned with 
the corresponding portion of human ork ("h") in Figure 6. Identical amino acids are 
10 indicated with a line; conservative or similar amino acid changes are indicated with two 
dots or one dot, respectively. The percent similarity of these murine and human ork 
amino acid sequences is 99.2% and the percent identity is 97.5%, as determined by the 
GAP computer program. 

The murine ork sequence corresponds to amino acids 886-1 124 of human ork, 
15 a region falling within the cytoplasmic domain and extending to the C-terminus of the 
human ork protein. The murine cDNA may be used as a probe to screen the same or a 
different murine cDNA library to identify a full length murine ork clone. Alternatively, 
given the similarity of the human and murine sequences, murine libraries may be 
screened using a probe derived from the extracellular region of human ork. 
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SEQUENCE LISTING 



PCT/US93/06093 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Ziegler, Steven F. 
(ii) TITLE OF INVENTION: NOVEL TYROSINE KINASE 
10 (iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Immunex Corporation 

(B) STREET: 51 University Street 
15 (C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: US 

(F) ZIP: 98101 

20 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



25 



30 



35 



40 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/905,600 

(B) FILING DATE: 26-JUN-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Seese, Kathryn A. 

(B) REGISTRATION NUMBER: 32,172 

(C) REFERENCE /DOCKET NUMBER: 2609 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 587-0430 

(B) TELEFAX: (206) 233-0644 

(C) TELEX: 756822 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 4138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: cDNA to mRNA 



(i:-:) FEATURE: 

(A) NAME /KEY : CDS 
55 (B) LOCATION: 149.. 3523 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CTTCTGTGCT GTTCCTTCTT GCCTCTAACT TGTAAACAAG ACGTACTAGG ACGATGCTAA 60 

TGGAAAGTCA CAAACCGCTG GGTTTTTGAA AGGATCCTTG GGACCTCATG CACATTTGTG 12 0 



60 



BNSDOCID <WO 9400469A1> 



WO 94/00469 



PCT/US93/06093 



29 



GAAACTGGAT GGAGAGATTT GGGGAAGC ATG GAC TCT TTA GCC AGC TTA GTT 172 

Met Asp Ser Leu Ala Ser Leu Val 
1 5 

5 

CTC TGT GGA GTC AGC TTG CTC CTT TCT GGA ACT GTG GAA GGT GCC ATG 220 
Leu Cys Gly Val Ser Leu Leu Leu Ser Gly Thr Val Glu Gly Ala Met 
10 15 20 

10 GAC TTG ATC TTG ATC AAT TCC CTA CCT CTT GTA TCT GAT GCT GAA ACA 2 68 

Asp Leu lie Leu lie Asn Ser Leu" Pro Leu Val Ser Asp Ala Glu Thr 
25 30 35 40 

TCT CTC ACC TGC ATT GCC TCT GGG TGG CGC CCC CAT GAG CCC ATC ACC 316 
15 Ser Leu Thr Cys lie Ala Ser Gly Trp Arg Pro His Glu Pro lie Thr 

45 50 55 

ATA GGA AGG GAC TTT GAA GCC TTA ATG AAC CAG CAC CAG GAT CCG CTG 364 
lie Gly Arg Asp Phe Glu Ala Leu Met Asn Gin His Gin Asp Pro Leu 
20 60 65 70 

GAA GTT ACT CAA GAT GTG ACC AGA GAA TGG GCT AAA AAA GTT GTT TGG 412 

Glu Val Thr Gin Asp Val Thr Arg Glu Trp Ala Lys Lys Val Val Trp 

75 80 85 

25 

AAG AGA GAA AAG GCT AGT AAG ATC AAT GGT GCT TAT TTC TGT GAA GGG 4 60 

Lys Arg Glu Lys Ala Ser Lys lie Asn Gly Ala Tyr Phe Cys Glu Gly 

90 95 100 

30 CGA GTT CGA GGA GAG GCA ATC AGG ATA CGA ACC ATG AAG ATG CGT CAA 50 8 

Arg Val Arg Gly Glu Ala He Arg He Arg Thr Met Lys Met Arg Gin 
105 110 115 120 

CAA GCT TCC TTC CTA CCA GCT ACT TTA ACT ATG ACT GTG GAC AAG GGA 55 6 

35 Gin Ala Ser Phe Leu Pro Ala Thr Leu Thr Met Thr Val Asp Lys Gly 

125 130 135 

GAT AAC GTG AAC ATA TCT TTC AAA AAG GTA TTG ATT AAA GAA GAA GAT 60 4 

Asp Asn Val Asn He Ser Phe Lys Lys Val Leu He Lys Glu Glu Asp 
40 140 145 150 

GCA GTG ATT TAC AAA AAT GGT TCC TTC ATC CAT TCA GTG CCC CGG CAT 652 

Ala Val He Tyr Lys Asn Gly Ser Phe He His Ser Val Pro Arg His 
155 160 165 

45 

GAA GTA CCT GAT ATT CTA GAA GTA CAC CTG CCT CAT GCT CAG CCC CAG 70 0 

Glu Val Pro Asp He Leu Glu Val His Leu Pro His Ala Gin Pro Gin 
170 175 180 

50 GAT GCT GGA GTG TAC TCG GCC AGG TAT ATA GGA GGA AAC CTC TTC ACC 7 48 

Asp Ala Gly Val Tyr Ser Ala Arg Tyr He Gly Gly Asn Leu Phe Thr 
185 * 190 195 200 

TCG GCC TTC ACC AGG CTG ATA GTC CGG AGA TGT GAA GCC CAG AAG TGG 7 96 

55 Ser Ala Phe Thr Arg Leu lie Val Arg Arg Cys Glu Ala Gin Lys Trp 

205 210 "* 215 

GGA CCT GAA TGC AAC CAT CTC TGT ACT GCT TGT ATG AAC AAT GGT GTC 844 
Gly Pro Glu Cys Asn His Leu Cys Thr Ala Cys Met Asn Asn Gly Val 
60 220 2*25 230 
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TGC CAT GAA GAT ACT GGA GAA TGC ATT TGC CCT CCT GGG TTT ATG GGA 8 92 

Cys His Glu Asp Thr Gly Glu Cys He Cys Pro Pro Gly Phe Met Gly 
235 240 245 

5 AGG ACG TGT GAG AAG GCT TGT GAA CTG CAC ACG TTT GGC AG A ACT TGT 940 
Arg Thr Cys Glu Lys Ala Cys Glu Leu His Thr Phe Gly Arg Thr Cys 
250 255 260 

AAA GAA AGG TGC AGT GGA CAA GAG GGA TGC AAG TCT TAT GTG TTC TGT 988 
10 Lys Glu Arg Cys Ser Gly Gin Glu Gly Cys Lys Ser Tyr Val Phe Cys 
265 270 275 280 

CTC CCT GAC CCC TAT GGG TGT TCC TGT GCC AC A GGC TGG AAG GGT CTG 103 6 

Leu Pro Asp Pro Tyr Gly Cys Ser Cys Ala Thr Gly Trp Lys Gly Leu 
15 285 290 295 

CAG TGC AAT GAA GCA TGC CAC CCT GGT TTT TAC GGG CCA GAT TGT AAG 108 4 

Gin Cys Asn Glu Ala Cys His Pro Gly Phe Tyr Gly Pro Asp Cys Lys 
300 " 305 310 

20 

CTT AGG TGC AGC TGC AAC AAT GGG GAG ATG TGT GAT CGC TTC CAA GGA 1132 
Leu Arg Cys Ser Cys Asn Asn Gly Glu Met Cys Asp Arg Phe Gin Gly 
315 320 325 

25 TGT CTC TGC TCT CCA GGA TGG CAG GGG CTC CAG TGT GAG AGA GAA GGC 1180 
Cys Leu Cys Ser Pro Gly Trp Gin Gly Leu Gin Cys Glu Arg Glu Gly 
330 335 340 

ATA CCG AGG ATG ACC CCA AAG ATA GTG GAT TTG CCA GAT CAT ATA GAA 1228 
30 He Pro Arg Met Thr Pro Lys He Val Asp Leu Pro Asp His He Glu 
345 350 355 360 

GTA AAC AGT GGT AAA TTT AAT CCC ATT TGC AAA GCT TCT GGC TGG CCG 127 6 

Val Asn Ser Gly Lys Phe Asn Pro He Cys Lys Ala Ser Gly Trp Pro 
35 365 370 375 

CTA CCT ACT AAT GAA GAA ATG ACC CTG GTG AAG CCG GAT GGG ACA GTG 132 4 

Leu Pro Thr Asn Glu Glu Met Thr Leu Val Lys Pro Asp Gly Thr Val 
380 385 390 

40 

CTC CAT CCA AAA GAC TTT AAC CAT ACG GAT CAT TTC TCA GTA GCC ATA 1372 

Leu His Pro Lys Asp Phe Asn His Thr Asp His Phe Ser Val Ala He 
395 400 405 

45 TTC ACC ATC CAC CGG ATC CTC CCC CCT GAC TCA GGA GTT TGG GTC TGC 1420 
Phe Thr He His Arg He Leu Pro Pro Asp Ser Gly Val Trp Val Cys 
410 415 420 

AGT GTG AAC ACA GTG GCT GGG ATG GTG GAA AAG CCC TTC AAC ATT TCT 14 68 

50 Ser Val Asn Thr Val Ala Gly Met Val Glu Lys Pro Phe Asn He Ser 
425 430 435 440 

GTT AAA GTT CTT CCA AAG CCC CTG AAT GCC CCA AAC GTG ATT GAC ACT 1516 
Val Lys Val Leu Pro Lys Pro Leu Asn Ala Pro Asn Val He Asp Thr 
55 445 450 455 

GGA CAT AAC TTT GCT GTC ATC AAC ATC AGC TCT GAG CCT TAC TTT GGG 1564 

Gly His Asn Phe Ala Val lie Asn lie Ser Ser Glu Pro Tyr Phe Gly 
460 465 470 

60 
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GAT GGA CCA ATC AAA TCC AAG AAG CTT CTA TAC AAA CCC GTT AAT CAC 1612 

Asp Gly Pro lie Lys Ser Lys Lys Leu Leu Tyr Lys Pro Val Asn His 
475 480 485 



5 TAT GAG GCT TGG CAA CAT ATT CAA GTG ACA AAT GAG ATT GTT ACA CTC 16 60 

Tyr Glu Ala Trp Gin His He Gin Val Thr Asn Glu He Val Thr Leu 
490 495 500 

AAC TAT TTG GAA CCT CGG ACA GAA TAT GAA CTC TGT GTG CAA CTG GTC 17 08 

10 Asn Tyr Leu Glu Pro Arg Thr Glu Tyr Glu Leu Cys Val Gin Leu Val 
505 510 515 ' 520 



CGT CGT GGA GAG GGT GGG GAA GGG CAT CCT GGA CCT GTG AGA CGC TTC 17 5 6 

Arg Arg Gly Glu Gly Gly Glu Gly His Pro Gly Pro Val Arg Arg Phe 
15 525 530 /• 535 



20 



ACA ACA GCT TCT ATC GGA CTC CCT CCT CCA AGA GGT CTA AAT CTC CTG 1804 

Thr Thr Ala Ser lie Gly Leu Pro Pro Pro Arg Gly Leu Asn Leu Leu 

540 545 550 

CCT AAA AGT CAG ACC ACT CTA AAT TTG ACC TGG CAA CCA ATA TTT CCA 18 52 

Pro Lys Ser Gin Thr Thr Leu Asn Leu Thr Trp Gin Pro He Phe Pro 

555 560 565 

25 AGC TCG GAA GAT GAC TTT TAT GTT GAA GTG GAG AGA AGG TCT GTG CAA 1900 

Ser Ser Glu Asp Asp Phe Tyr Val Glu Val Glu Arg Arg Ser Val Gin 

570 575 580 



AAA AGT GAT CAG CAG AAT ATT AAA GTT CCA GGC AAC TTG ACT TCG GTG 194 8 

30 Lys Ser Asp Gin Gin Asn He Lys Val Pro Gly Asn Leu Thr Ser Val 

585 590 " 595 600 

CTA CTT AAC AAC TTA CAT CCC AGG GAG CAG TAC GTG GTC CGA GCT AGA 1996 

Leu Leu Asn Asn Leu His Pro Arg Giu Gin Tyr Val Val Arg Ala Arg 

35 605 610 * 615 



40 



GTC AAC ACC AAG GCC CAG GGG GAA TGG AGT GAA GAT CTC ACT GCT TGG 204 4 

Val Asn Thr Lys Ala Gin Gly Glu Trp Ser Glu Asp Leu Thr Ala Trp 

620 ~ 625 630 

ACC CTT AGT GAC ATT CTT CCT CCT CAA CCA GAA AAC ATC AAG ATT TCC 2 0 92 

Thr Leu Ser Asp He Leu Pro Pro Gin Pro Glu Asn He Lys He Ser 

635 640 645 

45 AAC ATT ACA CAC TCC TCG GCT GTG ATT TCT TGG ACA ATA TTG GAT GGC 214 0 

Asn He Thr His Ser Ser Ala Val He Ser Trp Thr He Leu Asp Gly 

650 655 660 

TAT TCT ATT TCT TCT ATT ACT ATC CGT TAC AAG GTT CAA GGC AAG AAT 2188 

50 Tyr Ser He Ser Ser He Thr He Arg Tyr Lys Val Gin Gly Lys Asn 

665 670 675 680 

GAA GAC CAG CAC GTT GAT GTG AAG ATA AAG AAT GCC ACC ATC ATT CAG 223 6 

Glu Asp Gin His Val Asp Val Lys He Lys Asn Ala Thr He He Gin 

55 685 690 695 

TAT CAG CTC AAG GGC CTA GAG CCT GAA ACA GCA TAC CAG GTG GAC ATT 22 8 4 

Tyr Gin Leu Lys Gly Leu Glu Pro Giu Thr Ala Tyr Gin Val Asp He 

700 " 705 710 

60 
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TTT GCA GAG AAC AAC ATA GGG TCA AGC AAC CCA GCC TTT TCT CAT GAA 2332 

Phe Ala Glu Asn Asn lie Gly Ser Ser Asn Pro Ala Phe Ser His Glu 
715 " "20 725 

5 CTG GTG ACC CTC CCA GAA TCT CAA GCA CCA GCG GAC CTC GGA GGG GGG 2380 

Leu Val Thr Leu Pro Glu Ser Gin Ala Pro Ala Asp Leu Gly Gly Gly 
730 735 740 

AAG ATG CTG CTT ATA GCC ATC CTT GGC TCT GCT GGA ATG ACC TGC CTG 2 42 8 

10 Lys Met Leu Leu, He Ala He Leu Gly Ser Ala Gly Met Thr Cys Leu 
745 ' 750 755 760 

ACT GTG CTG TTG GCC TTT CTG ATC ATA TTG CAA TTG AAG AGG GCA AAT 2 47 6 

Thr Val Leu Leu Ala Phe Leu He He Leu Gin Leu Lys Arg Ala Asn 
15 765 770 ~ 775 

GTG CAA AGG AG A ATG GCC CAA GCC TTC CAA AAC GTG AGG GAA GAA CCA 2 52 4 

Val Gin Arg Arg Met Ala Gin Ala Phe Gin Asn Val Arg Glu Glu Pro 
780 785 790 

20 

GCT GTG CAG TTC AAC TCA GGG ACT CTG GCC CTA AAC AGG AAG GTC AAA 2 572 

Ala Val Gin Phe Asn Ser Gly Thr Leu Ala Leu Asn Arg Lys Val Lys 
795 800 805 

25 AAC AAC CCA GAT CCT AC A ATT TAT CCA GTG CTT GAC TGG AAT GAC ATC 2 620 

Asn Asn Pro Asp Pro Thr He Tyr Pro Val Leu Asp Trp Asn Asp He 
810 815 820 

AAA TTT CAA GAT GTG ATT GGG GAG GGC AAT TTT GGC CAA GTT CTT AAG 26 68 

30 Lys Phe Gin Asp Val He Gly Glu Gly Asn Phe Gly Gin Val Leu Lys 
825 830 835 840 

GCG CGC ATC AAG AAG GAT GGG TTA CGG ATG GAT GCT GCC ATC AAA AGA 2716 

Ala Arg He Lys Lys Asp Gly Leu Arg Met Asp Ala Ala lie Lys Arg 
35 845 850 855 

ATG AAA GAA TAT GCC TCC AAA GAT GAT CAC AGG GAC TTT GCA GGA GAA 27 64 

Met Lys Glu Tyr Ala Ser Lys Asp Asp His Arg Aso Phe Ala Gly Glu 
860 865 * 870 

40 

CTG GAA GTT CTT TGT AAA CTT GGA CAC CAT CCA AAC ATC ATC AAT CTC 2 812 

Leu Glu Val Leu Cys Lys Leu Gly His His Pro Asn He He Asn Leu 
875 880 885 

45 TTA GGA GCA TGT GAA CAT CGA GGC TAC TTG TAC CTG GCC ATT GAG TAC 2 8 60 

Leu Gly Ala Cys Glu His Arg Gly Tyr Leu Tyr Leu Ala He Glu Tyr 
890 895 900 

GCG CCC CAT GGA AAC CTT CTG GAC TTC CTT CGC AAG AGC CGT GTG CTG 2 908 

50 Ala Pro His Gly Asn Leu Leu Asp Phe Leu Arg Lys Ser Arg Val Leu 
905 910 915 920 

GAG ACG GAC CCA GCA TTT GCC ATT GCC AAT AGC ACC GCG TCC ACA CTG 2 95 6 

Glu Thr Asp Pro Ala Phe Ala He Ala Asn Ser Thr Ala Ser Thr Leu 
55 925 930 935 

TCC TCC CAG CAG CTC CTT CAC TTC GCT GCC GAC GTG GCC CGG GGC ATG 3004 

Ser Ser Gin Gin Leu Leu His Phe Ala Ala Asp Val Ala Arg Gly Met 
940 945 950 

60 
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GAC TAC TTG AGC CAA AAA CAG TTT ATC CAC AGG GAT CTG GCT GCC AGA 
Asp Tyr Leu Ser Gin Lys Gin Phe lie His Arg Asp Leu Ala Ala Arg 
955 960 965 

AAC ATT TTA GTT GGT GAA AAC TAT GTG GCA AAA ATA GCA GAT TTT GGA 
Asn lie Leu Val Gly Glu Asn Tyr Val Ala Lys lie Ala Asp Phe Gly 
970 975 980 

TTG TCC CGA GGT CAA GAG GTG TAC GTG AAA AAG ACA ATG GGA AGG CTC 
Leu Ser Arg Gly Gin Glu Val Tyr Val Lys Lys Thr Met Gly Arg Leu 
985 990 995 1000 

CCA GTG CGC TGG ATG GCC ATC GAG TCA CTG AAT TAC AGT GTG TAC ACA 
Pro Val Arg Trp Met Ala He Glu Ser Leu Asn Tyr Ser Val Tyr Thr 
1005 1010 1015 

ACC AAC AGT GAT GTA TGG TCC TAT GGT GTG TTA CTA TGG GAG ATT GTT 
Thr Asn Ser Asp Val Trp Ser Tyr Gly Val Leu Leu Trp Glu He Val 
1020 1025 1030 

AGC TTA GGA GGC ACA CCC TAC TGC GGG ATG ACT TGT GCA GAA CTC TAC 
Ser Leu Gly Gly Thr Pro Tyr Cys Gly Met Thr Cys Ala Glu Leu Tyr 
1035 1040 ' 1045 

GAG AAG CTG CCC CAG GGC TAC AGA CTG GAG AAG CCC CTG AAC TGT GAT 
Glu Lys Leu Pro Gin Gly Tyr Arg Leu Glu Lys Pro Leu Asn Cys Asp 
1050 1055 1060 . 

GAT GAG GTG TAT GAT CTA ATG AGA CAA TGC TGG CGG GAG AAG CCT TAT 
Asp Glu Val Tyr Asp Leu Met Arg Gin Cys Trp Arg Glu Lys Pro Tyr 
1065 1070 1075 1080 

GAG AGG CCA TCA TTT GCC CAG ATA TTG GTG TCC TTA AAC AGA ATG TTA 
Glu Arg Pro Ser Phe Ala Gin He Leu Val Ser Leu Asn Arg Met Leu 
1085 1090 1095 

GAG GAG CGA AAG ACC TAC GTG AAT ACC ACG CTT TAT GAG AAG TTT ACT 
Glu Glu Arg Lys Thr Tyr Val Asn Thr Thr Leu Tyr Glu Lys Phe Thr 
1100 1105 1110 

TAT GCA GGA ATT GAC TGT TCT GCT GAA GAA GCG GCC TAGGACAGAA 
Tyr Ala Gly He Asp Cys Ser Ala Glu Glu Ala Ala 

1115 1120 112 

CATCTGTATA CCCTCTGTTT CCCTTTCACT GGCATGGGAG ACCCTTGACA ACTGCTGAGA 

AAACATGCCT CTGCCAAAGG ATGTGATATA TAAGTGTACA TATGTGCTGG AATTCTAACA 

AGTCATAGGT TAATATTTAA GACACTGAAA AATCTAAGTG ATATAAATCA GATTCTTCTC 

TCTCATTTTA TCCCTCACCT GTAGCATGCC AGTCCCGTTT CATTTAGTCA TGTGACCACT 

CTGTCTTGTG TTTCCACAGC CTGCAAGTTC AGTCCAGGAT GCTAACATCT AAAAATAGAC 

TTAAATCTCA TTGCTTACAA GCCTAAGAAT CTTTAGAGAA GTATACATAA GTTTAGGATA 

AAATAATGGG ATTTTCTTTT CTTTTCTCTG GTAATATTGA CTTGTATATT TTAAGAAATA 

ACAGAAAGCC TGGGTGACAT TTGGGAGACA TGTGACATTT ATATATTGAA TTAATATCCC 



TACATGTATT GCACATTGTA AAAAGTTTTA GTTTTGATGA GTTGTGAGTT TACCTTGTAT 
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ACTGTAGGCA CACTTTGCAC TGATATATCA TGAGTGAATA AATGTCTTGC CTACTCAAAA 4130 
AAAAAAAA 41 38 

5 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 112 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



20 



35 



50 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Aso Ser Leu Ala Ser Leu Val Leu Cys Gly Val Ser Leu Leu Leu 

1*5 10 15 

Ser Gly Thr Val Glu Gly Ala Met Asp Leu lie Leu lie Asn Ser Leu 

20 " 25 30 



Pro Leu Val Ser Asp Ala Glu Thr Ser Leu Thr Cys lie Ala Ser Gly 

25 35 40 45 

Trp Arg Pro His Glu Pro lie Thr lie Gly Arg Asp Phe Glu Ala Leu 

50 55 60 

30 Met Asn Gin His Gin Asp Pro Leu Glu Val Thr Gin Asp Val Thr Arg 

65 70 75 80 



Glu Trp Ala Lys Lys Val Val Trp Lys Arg Glu Lys Ala Ser Lys lie 

85 90 95 

Asn Gly Ala Tyr Phe Cvs Glu Gly Arg Val Arg Gly Glu Ala lie Arg 
100 " 105 110 



lie Arg Thr Met Lys Met Arg Gin Gin Ala Ser Phe Leu Pro Ala Thr 

40 115 120 125 

Leu Thr Met Thr Val Asp Lys Gly Asp Asn Val Asn lie Ser Phe Lys 

130 135 " 140 

45 Lys Val Leu lie Lys Glu Glu Asp Ala Val lie Tyr Lys Asn Gly Ser 

145 150 155 160 



Phe lie His Ser Val Pro Arg His Glu Val Pro Asp lie Leu Glu Val 
165 170 175 

His Leu Pro His Ala Gin Pro Gin Asp Ala Gly Val Tyr Ser Ala Arg 
180 185 190 



Tyr lie Gly Gly Asn Leu Phe Thr Ser Ala Phe Thr Arg Leu He Val 

55 195 200 205 

Arg Arg Cys Glu Ala Gin Lys Trp Gly Pro Glu Cys Asn His Leu Cys 

210 "* 215 * 220 

60 Thr Ala Cys Met Asn Asn Gly Val Cys His Glu Asp Thr Gly Glu Cys 

225 230 235 240 
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35 



10 



25 



40 



55 



lie Cys Pro Pro Gly Phe Met Gly Arg Thr Cys Glu Lys Ala Cys Glu 
245 250 255 

Leu His Thr Phe Gly Arg Thr Cys Lys Glu Arg Cys Ser Gly Gin Glu 
260 265 270 

Gly Cys Lys Ser Tyr Val Phe Cys Leu Pro Asp Pro Tyr Gly Cys Ser 
275 280 285 

Cys Ala Thr Gly Trp Lys Gly Leu Gin Cys Asn Glu Ala Cys His Pro 
290 295 300 



Gly Phe Tyr Gly Pro Asp Cys Lys Leu Arg Cys Ser Cys Asn Asn Gly 

15 305 310 315 320 

Glu Met Cys Asp Arg Phe Gin Gly Cys Leu Cys Ser Pro Gly Trp Gin 

325 330 335 

20 Gly Leu Gin Cys Glu Arg Glu Gly lie Pro Arg Met Thr Pro Lys lie 

340 345 350 



Val Asp Leu Pro Asp His lie Glu Val Asn Ser Gly Lys Phe Asn Pro 

355 360 365 

lie Cys Lys Ala Ser Gly Trp Pro Leu Pro Thr Asn Glu Glu Met Thr 

370 375 380 



Leu Val Lys Pro Asp Gly Thr Val Leu His Pro Lys Aso Phe Asn His 

30 385 390 395 * 400 

Thr Asp His Phe Ser Val Ala He Phe Thr He His Arg He Leu Pro 

405 410 415 

35 Pro Asp Ser Gly Val Trp Val Cys Ser Val Asn Thr Val Ala Gly Met 

420 425 430 



Val Glu Lys Pro Phe Asn He Ser Val Lys Val Leu Pro Lys Pro Leu 
435 440 445 

Asn Ala Pro Asn Val He Asp Thr Gly His Asn Phe Ala Val He Asn 
450 455 460 



He Ser Ser Glu Pro Tyr Phe Gly Asp Gly Pro He Lys Ser Lys Lys 
45 '565 470 475 480 

Leu Leu Tyr Lys Pro Val Asn His Tyr Glu Ala Trp Gin His He Gin 
485 490 495 

50 Val Thr Asn Glu He Val Thr Leu Asn Tyr Leu Glu Pro Arg Thr Glu 
500 505 510 



Tyr Glu Leu Cys Val Gin Leu Val Arg Arg Gly Glu Glv Gly Glu Gly 

515 520 525 

His Pro Gly Pro Val Arg Arg Phe Thr Thr Ala Ser lie Gly Leu Pro 

530 535 540 



Pro Pro Arg Gly Leu Asn Leu Leu Pro Lys Ser Gin Thr Thr Leu Asn 
60 545 550 555 560 
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Leu Thr Tro Gin Pro He Phe Pro Ser Ser Glu Asp Asp Phe Tyr Val 
565 570 575 

Glu Val Glu Arg Arg Ser Val Gin Lys Ser Asp Gin Gin Asn He Lys 
5 580 585 590 

Val Pro Gly Asn Leu Thr Ser Val Leu Leu Asn Asn Leu His Pro Arg 
595 600 605 

10 Glu Gin Tyr Val Val Arg Ala Arg Val Asn Thr Lys Ala Gin Gly Glu 
610 615 620 



15 



30 



45 



Trp Ser Glu Asp Leu Thr Ala Trp Thr Leu Ser Asp He Leu Pro Pro 

625 630 635 640 

Gin Pro Glu Asn He Lys He Ser Asn He Thr His Ser Ser Ala Val 

645 650 655 



He Ser Trp Thr He Leu Asp Gly Tyr Ser He Ser Ser He Thr He 

20 660 665 670 

Arg Tyr Lys Val Gin Gly Lys Asn Glu Asp Gin His Val Asp Val Lys 

675 680 685 

25 He Lys Asn Ala Thr He lie Gin Tyr Gin Leu Lys Gly Leu Glu Pro 

690 695 700 



Glu Thr Ala Tyr Gin Val Asp lie Phe Ala Glu Asn Asn lie Gly Ser 
705 710 715 720 

Ser Asn Pro Ala Phe Ser His Glu Leu Val Thr Leu Pro Glu Ser Gin 
725 730 735 



Ala Pro Ala Asp Leu Gly Gly Gly Lys Met Leu Leu lie Ala He Leu 

35 740 ' 745 750 

Gly Ser Ala Gly Met Thr Cys Leu Thr Val Leu Leu Ala Phe Leu He 

755 - 760 765 

40 He Leu Gin Leu Lys Arg Ala Asn Val Gin Arg Arg Met Ala Gin Ala 

770 " 775 780 



Phe Gin Asn Val Arg Glu Glu Pro Ala Val Gin Phe Asn Ser Gly Thr 
785 790 795 800 

Leu Ala Leu Asn Arg Lys Val Lys Asn Asn Pro Asp Pro Thr He Tyr 
805 810 815 



Pro Val Leu Asp Trp Asn Asp He Lys Phe Gin Asp Val He Gly Glu 
50 820 825 830 

Gly Asn Phe Gly Gin Val Leu Lys Ala Arg lie Lys Lys Asp Gly Leu 
835 840 845 

55 Arg Met Asp Ala Ala lie Lys Arg Met Lys Glu Tyr Ala Ser Lys Asp 

850 855 ** 860 



60 



Asp His Arg Asp Phe Ala Gly Glu Leu Glu Val Leu Cys Lys Leu Gly 
865 * 870 875 880 
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His His Pro Asn lie lie Asn Leu Leu Gly Ala Cys Glu His Arg Gly 
885 890 895 

Tyr Leu Tyr Leu Ala lie Glu Tyr Ala Pro His Gly Asn Leu Leu Asp 
5 900 905 910 

Phe Leu Arg Lys Ser Arg Val Leu Glu Thr Asp Pro Ala Phe Ala lie 
915 920 925 

10 Ala Asn Ser Thr Ala Ser Thr Leu Ser Ser Gin Gin Leu Leu His Phe 
930 935 940 



15 



30 



45 



Ala Ala Asp Val Ala Arg Gly Met Asp Tyr Leu Ser Gin Lys Gin Phe 

945 950 955 960 

lie His Arg Asp Leu Ala Ala Arg Asn lie Leu Val Gly Glu Asn Tyr 

965 970 975 



Val Ala Lys lie Ala Asp Phe Gly Leu Ser Arg Gly Gin Glu Val Tyr 
20 980 985 990 

Val Lys Lys Thr Met Gly Arg Leu Pro Val Arg Trp Met Ala lie Glu 
995 1000 1005 

25 Ser Leu Asn Tyr Ser Val Tyr Thr Thr Asn Ser Asp Val Trp Ser Tyr 

1010 1015 1020 



Gly Val Leu Leu Trp Glu lie Val Ser Leu Gly Gly Thr Pro Tyr Cys 
1025 1030 1035 1040 

Gly Met Thr Cys Ala Glu Leu Tyr Glu Lys Leu Pro Gin Gly Tyr Arg 
1045 1050 1055 



Leu Glu Lys Pro Leu Asn Cys Asp Asp Glu Val Tyr Asp Leu Met Arg 
35 1060 1065 1070 

Gin Cys Trp Arg Glu Lys Pro Tyr Glu Arg Pro Ser Phe Ala Gin lie 
1075 1080 1085 

40 Leu Val Ser Leu Asn Arg Met Leu Glu Glu Arg Lys Thr Tyr Val Asn 

1090 1095 1100 



Thr Thr Leu Tyr Glu Lys Phe Thr Tyr Ala Gly lie Asp Cys Ser Ala 
1105 1110 1115 1120 

Glu Glu Ala Ala 



50 (2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Val Pro TrD Leu Gly Ala Cys Glu His Arg Gly Tyr Leu Tyr Leu Ala 
1 ' 5 ' 10 15 

5 

He Glu Tyr Ala Pro His Gly Asn Leu Leu Asp Phe Leu Arg Lys Ser 
20 25 30 

Arg Val Leu Glu Thr Asp Pro Ala Phe Ala He Ala Asn Ser Thr Ala 
10 35 40 45 

Ser He Met Ser Ser Gin Gin Leu Leu His Phe Ala Ala Asp Val Ala 
50 55 60 

15 Arg Gly Met Asp Tyr Leu Ser Gin Lys Gin Phe He His Arg Asp Leu 

65 70 75 80 



20 



35 



Ala Ala Arg Asn He Leu Val Gly Glu Asn Tyr lie Ala Lys He Ala 
85 90 95 

Asp Phe Gly Leu Ser Arg Gly Gin Glu Val Tyr Val Lys Lys Thr Met 
100 105 110 



Glv Arg Leu Pro Val Arg Trp Met Ala He Glu Ser Leu Asn Tyr Ser 

25 " 115 120 125 

Val Tyr Thr Thr Asn Ser Asp Val Trp Ser Tyr Gly Val Leu Leu Trp 

130 135 140 

30 Glu He Val Ser Leu Gly Gly Thr Pro Tyr Cys Gly Met Thr Cys Ala 

145 150 155 160 



Glu Leu Tyr Glu Lys Leu Pro Gin Gly Tyr Arg Leu Glu Lys Pro Leu 

165 170 175 

Asn Cys Asp Asp Glu Val Tyr Asp Leu Met Arg Gin Cys Trp Arg Glu 

180 " 185 190 



Lys Pro Tyr Glu Arg Pro Ser Phe Ala Gin He Leu Val Ser Leu Asn 
40 ^ 195 200 205 

Arg Met Leu Glu Glu Arg Lys Thr Tyr Val Asn Thr Thr Leu Tyr Glu 

210 215 220 

45 Lys Phe Thr Tyr Ala Gly He Asp Cys Ser Ala Glu Glu Ala Ala 

225 230 235 
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What is claimed is: 

5 1 . An isolated DNA sequence encoding a biologically active human ork 

polypeptide, wherein the DNA sequence is selected from the group consisting of: 

(a) DNA comprising the coding region of the DNA sequence of SEQ ID 

NO:l; 

(b) DNA sequences that hybridize under moderately stringent conditions to 
10 the DNA of (a) and which encode a biologically active ork; and 

(c) DNA sequences that are degenerate as a result of the genetic code to a DNA 
sequence of (a) or (b), and which encode a biologically active orL 



2 . An isolated DNA sequence according to claim 1 wherein the ork 
15 polypeptide is a soluble human o rk polypeptide. 

3 . An isolated DNA sequence according to claim 2, wherein said soluble 
human ork polypeptide comprises amino acids 19-745 of the amino acid sequence of 
SEQ ID NO:l. 

20 

4. An isolated DNA sequence comprising a nucleotide sequence selected from 
the group consisting of nucleotides 149-3520, 203-3520, 149-2383, and 203-2383 of 
SEQ ID NO:l. 

25 5 . An expression vector comprising a DNA sequence according to claim 1. 

6. An expression vector comprising a DNA sequence according to claim 2. 

7 . An expression vector comprising a DNA sequence according to claim 3. 

30 

8. An expression vector comprising a DNA sequence according to claim 4. 

9. A process for preparing an ork polypeptide, comprising culturing a host cell 
transformed with an expression vector according to claim 5 under conditions promoting 

35 expression of ork, and recovering the ork polypeptide from the culture. 
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10. A process for preparing an ork polypeptide, comprising culturing a host cell 
transformed with an expression vector according to claim 6 under conditions promoting 
expression of ork and recovering the ork polypeptide from the culture. 



5 1 1 . A process for preparing an ork polypeptide, comprising culturing a host cell 

transformed with an expression vectoraccording to claim 7 under conditions promoting 
expression of ork, and recovering the ork polypeptide from the culture. 

1 2. A process for preparing an ork polypeptide, comprising culturing a host cell 
10 transformed with an expression vector according to claim 8 under conditions promoting 
expression of ork, and recovering the ork polypeptide from the culture. 



13. A substantially homogeneous purified biologically active ork protein 
encoded by a DNA sequence selected from the group consisting of: 

1 5 (a) DNA comprising the coding region of the DNA sequence of SEQ ID 

NO:l; 

(b) DNA sequences that hybridize under moderately stringent conditions to 
the DNA of (a) and which encode a biologically active ork; and 

(c) DNA sequences that are degenerate as a result of the genetic code to a 
20 DNA sequence of (a) or (b), and which encode a biologically active ork, 

14. A purified ork according to claim 13, wherein said ork is human ork. 



15. A purified ork according to claim 14, wherein said ork is a soluble human 

25 ork. 



16. A purified human ork protein comprising an amino acid sequence selected 
from the group consisting of amino acids 19-1 124 of SEQ ID NO:2 and amino acids 
19-745 ofSEQIDNO:2. 

30 

17. A purified ork protein comprising an extracellular domain that comprises 
(from N- to C-terminus) an immunoglobulin-like domain, three EGF-like cysteine 
repeats, a sequence containing a pair of cysteine residues, and three FNIII-like repeats, 
wherein the amino acid sequence of said extracellular domain is at least 80% identical to 

35 the sequence presented as amino acids 19-745 of SEQ ID NO:l. 

18. A purified ork protein according to claim 17, wherein said protein 
additionally contains a transmembrane region and a cytoplasmic domain. 
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19. An antibody immunoreactive with orlc or an ork immunogen. 

20. An antisense or sense oligonucleotide that can inhibit transcription or 

5 translation of ork, comprising a sequence of at least about 14 nucleotides of the DNA 
sequence of SEQ ID NO: 1, or its DNA or RNA complement. 
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